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Background of the Invention 
Field of the Invention 

15 The present invention relates to isolated nucleic acid sequences 

encoding polypeptides having transcriptional activation activity and 
to the polypeptides. The invention also relates to nucleic acid 
constructs, vectors and host cells comprising the nucleic acid 
sequences. The invention further relates to host cells useful for the 

20 production of polypeptides in which the production or function of the 
transcriptional activator has been altered, as well as to methods for 
producing the polypeptides. 

Description of the Related Art 

25 The use of recombinant host cells in the expression of 

heterologous proteins has in recent years greatly simplified the 
production of large quantities of commercially valuable proteins which 
otherwise are obtainable only by purification from their native sources. 
Currently, there is a varied selection of expression systems from which 

30 to choose for the production of any given protein, including eubacterial 
and eukaryotic hosts. The selection of an appropriate expression system 
often depends not only on the ability of the host cell to produce 
adequate yields of the protein in an active state, but, to a large 
extent, may also be governed by the intended end use of the protein. 

35 One problem frequently encountered is the high level of 

proteolytic enzymes produced by a given host cell or present in the 



culture medium. One suggestion has been to provide host organisms 
deprived of the ability to produce specific proteolytic compounds. For 
example, WO 90/00192 (Genencor, Inc.) describes filamentous fungal hosts 
incapable of secreting enzymatically active aspartic proteinase. EP 574 
5 347 (Ciba Geigy AG) describes Aspergillus hosts defective in a serine 
protease of the subtilisin-type . WO 98/12300 (Novo Nordisk A/S) 
describes hosts defective in a metalloprotease and an alkaline protease. 
WO 97/12045 (Genencor, Inc.) describes yeast and bacterial host systems, 
which are rendered protease deficient resulting from a disruption of a 

10 promoter sequence involved in the regulation of a protease gene. 

Mattern, I.E., et al . , (1992. Mol Gen Genet 234:332-336) describe 
a mutant strain of Aspergillus niger, which was shown to have only 1 
to 2% of the extracellular protease activity of the parent strain, 
apparently due to a deficiency of at least two proteases, 

15 aspergillopepsin A and aspergillopepsin B. It was suggested that the 
protease deficient phenotype could result from a regulatory mutation 
affecting the expression of the genes coding for both proteases. 

The initiation of eukaryotic transcription at a specific promoter 
or set of promoters requires a eukaryotic transcriptional activator 

20 which is a polypeptide, but which is not itelf part of RNA polymerase. 
Many transcriptional activators bind to a specific site on the 
promoter to form a functional promoter necessary for the initiation of 
transcription of the polypeptide encoding sequence. However, a 
transcriptional activator may also be incorporated into an initiation 

25 complex only in the presence of other polypeptides. Polypeptides with 
transcriptional activation activity have been described in fungi, and 
a list of such polypeptides has been published (Dhawale, S.S., and 
Lane, A.C. 1993. Nucleic Acid Research 21:5537-5546). 

It is an object of the present invention to provide improved 

30 methods for increasing production of polypeptides in host cells in 
which the activity of a transcriptional activator involved in the 
regulation of protease production has been modified. 
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Summary of the Invention 

A first aspect of the present invention . relates to an isolated 
nucleic acid sequence encoding a transcriptional activator selected 
from the group consisting of : 
5 (a) a nucleic acid sequence having at least 70% identity with 

the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48; 

(b) a nucleic acid sequence encoding a polypeptide having an 
amino acid sequence which has at least 50% identity with the amino 
acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49; 

10 (c) a nucleic acid sequence which hybridizes under low 

stringency conditions with (i) the nucleic acid sequence of SEQ ID NO: 
1 or SEQ ID NO: 48, or (ii) its complementary strand, wherein the low 
stringency conditions are defined by prehybridization and 
hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 microg/ml sheared and 

15 denatured salmon sperm DNA, and 25% formamide, and wash conditions are 
defined by 50°C for 30 minutes in 2X SSC, 0.2% SDS; 

(d) an allelic variant of (a) , (b) , or (c) ; 

(e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has transcriptional 

20 activation activity; and 

(f ) a subsequence of (a) , (b) (c) , or (d) , wherein the 
subsequence encodes a polypeptide with the amino acid sequence of SEQ 
ID NO: 3. 

The nucleic acid sequence shown in SEQ ID NO: 1 is the 
25 Aspergillus niger prtT gene encoding the transcriptional factor shown 
in SEQ ID NO: 2 as described further below and in the Examples. 

The nucleic acid sequence shown in SEQ ID NO: 48 is the 
Aspergillus oryzae IF04177 prtT gene encoding the transcriptional 
factor shown in SEQ ID NO: 49. The A. oryzae prtT gene has a coding 
30 region starting in position 795 and ending at position 2931. The prtT 
gene has 4 introns in positions 1028-1135, 1538-1591, 2018-2066, and 
2297-2347, respectively. This is described further below and in the 
Examples . 

In another aspect, the invention also relates to nucleic acid 
35 constructs, vectors and host cells comprising the nucleic acid 
sequences, and to the polypeptides encoded by the nucleic acid 
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sequences. The invention further relates to host cells useful for the 
production of a polypeptide, in which the production or function of 
the transcriptional activator has been altered, as well as to methods 
for producing the polypeptide. 

5 

Brief Description of the Figures 

Figure 1 shows a restriction map of the plasmid pPAP, the 
construction of which is described in Example 1. 

Figure 2 shows a restriction map of the plasmid pAopyrGcosArpl, 
10 the construction of which is described in Example 1. 

Figure 3 shows a restriction map of the plasmid pEESl, the 
construction of which is described in Example 1. 

Figure 4 shows a restriction map of the plasmid pDprt, the 
construction of which is described in Example 3 . 
15 Figure 5 shows a restriction map of the plasmid pGPprt, the 

construction of which is described in Example 4 . 

Figure 6 shows the sequence of the insert in the two plasmids 
containing the PCR fragment of the A. oryzae prtT Zn 2+ -finger. ICA217 
is the sequence from one of the plasmids and ICA218 is the sequence 
20 from the other. 

Figure 7 shows plasmid pDV8 a pSP65 (Promega™) based plasmid 
containing the HSV-tk gene on a 1.2 kb Bglll/BamHl fragment inserted 
between a 1.0 kb Xhol/Bglll fragment of the A. nidulans gpd promoter 
and a 0.8 kb BamHI/Hindlll fragment containing the A. nidulans trpC 
25 transcriptional terminator. 

Figure 8 shows the construction of pJaL554 described in Example 

8 . 

Detailed Description of the Invention 
30 Nucleic Acid Sequences Encoding Transcriptional Activators 

A first aspect of the present invention relates to an isolated 
nucleic acid sequence encoding a transcriptional activator selected 
from the group consisting of: 

(a) a nucleic acid sequence having at least 70% identity with 
35 the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48; 
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(b) a nucleic acid sequence encoding a polypeptide having an 
amino acid sequence which has at least 50% identity with the amino 
acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49; 

(c) a nucleic acid sequence which hybridizes under low 
5 stringency conditions with (i) the nucleic acid sequence of SEQ ID NO: 

1 or SEQ ID NO: 48, or (ii) its complementary strand, wherein the low 
stringency conditions are defined by prehybridization and 
hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 micro g/ml sheared and 
denatured salmon sperm DNA, and 25% formamide, and wash conditions are 
10 defined by 50°C for 30 minutes in 2X SSC, 0.2% SDS; 

(d) an allelic variant of (a) , (b) , or (c) ; 

(e) a subsequence of (a) , (b) , (c) , or (d) , wherein the 
subsequence encodes a polypeptide fragment which has transcriptional 
activation activity; and 

15 (f ) a subsequence of (a) , (b) , (c) , or (d) , wherein the 

subsequence encodes a polypeptide with the amino acid sequence of SEQ 
ID NO: 3. 

The term "transcriptional activator" as used herein refers to a 
polypeptide which has the capability to activate a specific promoter 

20 or set of promoters necessary for the initiation of transcription of 
the polypeptide encoding sequence to which it is linked. 

The term "isolated nucleic acid sequence" as used herein refers 
to a nucleic acid sequence which is essentially free of other nucleic 
acid sequences, e.g., at least about 20% pure, preferably at least 

25 about 40% pure, more preferably at least about 60% pure, even more 
preferably at least about 80% pure, most preferably at least about 90% 
pure as determined by agarose electrophoresis. For example, an 
isolated nucleic acid sequence can be obtained by standard cloning 
procedures used in genetic engineering to relocate the nucleic acid 

30 sequence from its natural location to a different site where it will 
be reproduced. The cloning procedures may involve excision and 
isolation of a desired nucleic acid fragment comprising the nucleic 
acid sequence encoding the polypeptide, insertion of the fragment into 
a vector molecule, and incorporation of the recombinant vector into a 

35 host cell where multiple copies or clones of the nucleic acid sequence 
will be replicated. The nucleic acid sequence may be of genomic, 
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cDNA, RNA, semisynthetic, synthetic origin, or any combinations 
thereof . 

In a preferred embodiment, the nucleic acid sequence has a degree 
of identity to the nucleic acid sequence set forth in SEQ ID NO: 1 or 
5 SEQ ID NO: 48 of at least about 70%, preferably at least about 80%, 
more preferably at least about 85%, even more preferably at least 
about 90%, most preferably at least about 95%, even most preferably at 
least about 97%, and even more preferred at least 99% identity, which 
encodes an active polypeptide. For purposes of the present invention, 
10 the degree of identity between two nucleic acid sequences is 
determined by the Clustal method (Higgins, 1989, CABIOS 5:151-153) 
with an identity table, a gap penalty of 10, and a gap length penalty 
of 10. 

In an even more preferred embodiment, the nucleic acid sequence 

15 encoding a transcriptional activator has a nucleic acid sequence as 
set forth in SEQ ID NO: 1 or SEQ ID NO: 48. 

Modification of a nucleic acid sequence encoding a polypeptide of 
the present invention may be necessary for the synthesis of 
polypeptides substantially similar to the polypeptide. The term 

20 ''substantially similar" to the polypeptide refers to non-naturally 
occurring forms of the polypeptide. These polypeptides may differ in 
some engineered way from the polypeptide isolated from its native 
source. For example, it may be of interest to synthesize variants of 
the polypeptide where the variants differ in specific activity, 

25 binding specificity and/or affinity, or the like using, e.g., site- 
directed mutagenesis. The analogous sequence may be constructed on 
the basis of the nucleic acid sequence presented as the polypeptide 
encoding part of SEQ ID NO: 1 or SEQ ID NO: 48, e.g., a subsequence 
thereof, and/or by introduction of nucleotide substitutions which do 

30 not give rise to another amino acid sequence of the polypeptide 
encoded by the nucleic acid sequence, but which corresponds to the 
codon usage of the host organism intended for production of the 
enzyme, or by introduction of nucleotide substitutions which may give 
rise to a different amino acid sequence. For a general description of 

35 nucleotide substitution, see, e.g., Ford et al . , 1991, Protein 
Expression and Purification 2: 95-107. 
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In another preferred embodiment, the present invention relates to 
isolated nucleic acid sequences encoding polypeptides having an amino 
acid sequence which has a degree of identity to the amino acid 
sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 49 of at least about 
5 50%, preferably at least about 60%, preferably at least about 70%, 
more preferably at least about 80%, even more preferably at least 
about 90%, most preferably at least about 95%, even most preferably at 
least about 97%, and even more preferred at least 99%, which 
qualitatively retain the transcriptional activation activity of the 

10 polypeptides (hereinafter "homologous polypeptides") . 

In a preferred embodiment, the homologous polypeptides have an 
amino acid sequence which differs by five amino acids, preferably by 
four amino acids, more preferably by three amino acids, even more 
preferably by two amino acids, and most preferably by one amino acid 

15 from the amino acid sequence set forth in SEQ ID NO: 2 or SEQ ID NO: 
49. For purposes of the present invention, the degree of identity 
between two amino acid sequences is determined by the Clustal method 
(Higgins, 1989, supra) with an identity table, a gap penalty of 10, 
and a gap length penalty of 10 . 

20 Hybridization indicates that by methods of standard Southern 

blotting procedures, the nucleic acid sequence hybridizes to an 
oligonucleotide probe corresponding to the polypeptide encoding part 
of the nucleic acid sequence shown in SEQ ID NO: 1, under low to high 
stringency conditions (i.e., prehybridization and hybridization at 

25 42°C in 5x SSPE, 0.3% SDS, 200 micrograms/ml sheared and denatured 
salmon sperm DNA, and either 25, 35 or 50% formamide for low, medium 
and high stringencies, respectively) . In order to identify a clone or 
DNA which is homologous with SEQ ID NO: 1 or SEQ ID NO: 48, the 
hybridization reaction is washed three times for 30 minutes each using 

30 2X SSC, 0.2% SDS preferably at least 50°C, more preferably at least 
55°C, more preferably at least 60°C, more preferably at least 65°C, 
even more preferably at least 70°C, and most preferably at least 75°C. 

The nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48, or a 
subsequence thereof, as well as the amino acid sequence of SEQ ID NO: 

35 2 or SEQ ID NO: 49, or a partial sequence thereof, or the amino acid 
sequence of SEQ ID NO: 3, may be used to design an oligonucleotide 
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probe to identify and isolate or clone a homologous gene of any genus 
or species according to methods well known in the art. 

In particular, such probes can be used for hybridization with the 
genomic or cDNA of the genus or species of interest, following 
5 standard Southern blotting procedures, in order to identify and 
isolate the corresponding gene therein. Such probes can be 
considerably shorter than the entire sequence, but should be at least 
15, preferably at least 25, and more preferably at least 40 
nucleotides in length. Longer probes can also be used. Both DNA and 
10 RNA probes can be used. The probes are typically labeled for 
detecting the corresponding gene (for example, with 32 P, 3 H, 35 S, 
biotin, or avidin) . For example, molecules to which a 32 P-, 3 H- or 35 S- 
labelled oligonucleotide probe hybridizes may be detected by use of X- 
ray film. 

15 Thus, a genomic, cDNA or combinatorial chemical library prepared 

from such other organisms may be screened for DNA which hybridizes 
with the probes described above and which encodes a polypeptide with 
transcriptional activation activity. Genomic or other DNA from such 
other organisms may be separated by agarose or polyacrylamide gel 

20 electrophoresis, or other separation techniques. DNA from the 
libraries or the separated DNA may be transferred to and immobilized 
on nitrocellulose or other suitable carrier material. A clone or DNA 
which is homologous to SEQ ID NO: 1 or SEQ ID NO: 48 may then be 
identified following standard Southern blotting procedures. 

25 An allelic variant denotes any of two or more alternative forms 

of a gene occupying the same chomosomal locus. Allelic variation 
arises naturally through mutation, and may result in phenotypic 
polymorphism within populations. Gene mutations can be silent (i.e., 
no change in the encoded polypeptide) or may encode polypeptides 

30 having altered amino acid sequences. 

The term ''allelic variant of a polypeptide" is a polypeptide 
encoded by an allelic variant of a gene. In a preferred embodiment, 
the nucleic acid sequence encoding a transcriptional activator of the 
present invention is an allelic variant of a nucleic acid sequence 

35 selected from the group consisting of nucleic acid sequences: (a) 
having at least 70% identity with the nucleic acid sequence of SEQ ID 
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NO: 1 or SEQ ID NO: 48, (b) encoding a polypeptide having an amino 
acid sequence which has at least 50% identity with the amino acid 
sequence of SEQ ID NO: 2 or SEQ ID NO: 49, (c) which hybridizes under 
low stringency conditions with the nucleic acid sequence of SEQ ID NO: 
5 1 or SEQ ID NO: 48, or its complementary strand, and (d) encoding a 
polypeptide having the amino acid sequence of SEQ ID NO: 3. 

The present invention also encompasses nucleic acid sequences 
which differ from SEQ ID NO: 1 or SEQ ID NO: 48 by virtue of the 
degeneracy of the genetic code. The present invention also relates to 

10 subsequences of SEQ ID NO: 1 or SEQ ID NO: 49, wherein a subsequence 
of SEQ ID NO: 1 is a nucleic acid sequence encompassed by SEQ ID NO: 1 
or SEQ ID NO: 48 except that one or more nucleotides from the 5' 
and/or 3 1 end have been deleted. Preferably, a subsequence of SEQ ID 
NO: 1 or SEQ ID NO: 48 encodes a polypeptide fragment which has 

15 transcriptional activation activity. In a more preferred embodiment, 
a subsequence of SEQ ID NO: 1 or SEQ ID NO: 48 contains at least a 
nucleic acid sequence encoding the polypeptide sequence shown in SEQ 
ID NO: 3 . 

The techniques used to isolate or clone a nucleic acid sequence 

20 encoding a polypeptide are known in the art and include isolation from 
genomic DNA, preparation from cDNA, or a combination thereof. The 
cloning of the nucleic acid sequences of the present invention from 
such genomic DNA can be effected, e.g., by using methods based on 
polymerase chain reaction (PCR) or antibody screening of expression 

25 libraries to detect cloned DNA fragments with shared structural 
features. (See, e.g., Innis et al . , 1990, PCR: A Guide to Methods and 
Application, Academic Press, New York.) Other nucleic acid 

amplification procedures such as ligase chain reaction (LCR) , ligated 
activated transcription (LAT) and nucleic acid sequence-based 

30 amplification (NASBA) may be used. The nucleic acid sequence may be 
cloned from a microorganism, or another or related organism and thus, 
for example, may be an allelic or species variant of the polypeptide 
encoding region of the nucleic acid sequence. 

The transcriptional activators encoded by nucleic acid sequences 

35 which hybridize with an oligonucleotide probe which hybridizes with 
the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48, its 
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complementary strand, or allelic variants and subsequences of SEQ ID 
NO: 1 or SEQ ID NO: 48, or allelic variants and fragments of the 
transcriptional activators may be obtained from microorganisms of any 
genus . 

5 In a preferred embodiment, the transcriptional activators may be 

obtained from a fungal cell. "Fungi" as used herein includes the 
phyla Ascomycota, Basidiomycota, Chytridiomycota, and Zygomycota (as 
defined by Hawksworth, et al . , in Ainsworth and Bisby's Dictionary of 
The Fungi, 8th edition, 1995, CAB International, University Press, 
10 Cambridge, UK) as well as the Oomycota (as cited in Hawksworth et al . , 
1995, supra, page 171) and all mitosporic fungi (Hawksworth et al., 
1995, supra) . 

In preferred embodiment, the fungal source is a filamentous 
fungal strain. "Filamentous fungi" include all filamentous forms of 

15 the subdivision Eumycota and Oomycota (as defined by Hawksworth et 
al., 1995, supra). The filamentous fungi are characterized by a 
mycelia wall composed of chitin, cellulose, glucan, chitosan, mannan, 
and other complex polysaccharides. Vegetative growth is by hyphal 
elongation and carbon catabolism is obligately aerobic. Filamentous 

20 fungal strains include, but are not limited to, strains of Acremonium, 
Aspergillus, Aureobasidium, Cryptococcus , Filibasidium, Fusarium, 
Humicola, Magnaporthe, Mucor, Myceliophthora, Neocallimastix, 
Neurospora, Paecilomyces , Penicillium, Piromyces , Schizophyllum, 
Talaromyces, Thermoascus , Thielavia, Tolypocladium, and Trichoderma. 

25 In a more preferred embodiment, the nucleic acid sequence 

encoding a transcriptional activator of the present invention is 
obtained from a strain of Aspergillus, such as A. awamori or A. 
nidulans. Preferably, the nucleic acid sequence is obtained from a 
strain of A. niger or A. oryzae. Even more preferably, the nucleic 

30 acid sequence is obtained from an isolate of a strain of A. niger, DSM 
12298; e.g., the nucleic acid sequence set forth in SEQ ID NO: 1, or 
from A. oryzae IFO 4177, i.e., the nucleic acid sequence set forth in 
SEQ ID NO: 48. 

In another more preferred embodiment, the nucleic acid sequence 
35 encoding a transcriptional activator of the present invention is 
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obtained from a strain of Fusarium, such as F. oxysporum. Preferably, 
the strain is a strain of F. venenatum (Nirenberg sp. nov.) . 

In another preferred embodiment, the nucleic acid sequence 
encoding a transcriptional activator of the present invention is 
5 obtained from a yeast strain, such as a Candida, Kluyveromyces , 
Schizosaccharomyces, or Yarrowia strain. Preferably, the strain is a 
strain of Hansenula, Pichia, or Saccharomyces . 

It will be understood that for the aforementioned species, the 
invention encompasses both the perfect and imperfect states, and other 

10 taxonomic equivalents, e.g., anamorphs, regardless of the species name 
by which they are known. Those skilled in the art will readily 
recognize the identity of appropriate equivalents. For example, the 
polypeptides may be obtained from microorganisms which are taxonomic 
equivalents of Aspergillus as defined by Raper, K.D. and Fennel, D.I. 

15 (1965. The Genus Aspergillus, The Wilkins Company, Baltimore MD) , 
regardless of the species name by which they are known. Aspergilli 
are mitosporic fungi characterized by an aspergillum comprised of a 
conidiospore stipe with no known teleomorphic states terminating in a 
vesicle, which in turn bears one or two layers of synchronously formed 

20 specialized cells, variously referred to as sterigmata or phialides, 
and asexually formed spores referred to as conidia . "Known teleomorphs 
of Aspergillus include Eurotium, Neosartorya, and Emericella. Strains 
of Aspergillus and teleomorphs thereof are readily accessible to the 
public in a number of culture collections, such as the American Type 

25 Culture Collection (ATCC) , Deutsche Sammlung von Mikroorganismen und 
Zellkulturen GmbH (DSM) , Centraalbureau Voor Schimmelcultures (CBS) , 
and Agricultural Research Service Patent Culture Collection, Northern 
Regional Research Center (NRRL) . 

Furthermore, such transcriptional activators may be identified 

30 and obtained from other sources including microorganisms isolated from 
nature (e.g., soil, composts, water, etc.) using the above-mentioned 
probes. Techniques for isolating microorganisms from natural habitats 
are well known in the art. The nucleic acid sequence may then be 
derived by similarly screening a genomic or cDNA library of another 

35 microorganism. Once a nucleic acid sequence encoding a 

transcriptional activator has been detected with the probe (s), the 



sequence may be isolated or cloned by utilizing techniques which are 
known to those of ordinary skill in the art (see, e.g., J. Sambrook, 
E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory 
Manual, 2d edition, Cold Spring Harbor, New York) . 
5 In another preferred embodiment, the isolated nucleic acid 

sequence encodes a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 2 or SEQ ID NO: 49, or a fragment thereof, which has 
transcriptional activation activity. 

In another preferred embodiment, the isolated nucleic acid 
10 sequence encodes a polypeptide comprising the amino acid sequence of 
SEQ ID NO: 3. 

The present invention also relates to isolated nucleic acid 
sequences encoding a transcriptional activator of the present 
invention, which, e.g., using methods of standard Southern blotting 

15 procedures described above (cf., Sambrook, et al . , 1989, supra), 
hybridize under low stringency conditions, more preferably medium 
stringency conditions, and most preferably high stringency conditions, 
with an oligonucleotide probe which hybridizes under the same 
conditions with the nucleic acid sequence set forth in SEQ ID NO: 1 or 

20 SEQ ID NO: 48 or its complementary strand, or allelic variants and 
subsequences of SEQ ID NO: 1 or SEQ ID NO: 48 which encode polypeptide 
fragments which are transcriptional activators in fungi. 

In another more preferred embodiment, the nucleic acid sequence 
is the nucleic acid sequence encoding a polypeptide, which has DNA 

25 binding activity contained in the plasmid pEES which is contained in 
Escherichia coli DSM 12294. 

Nucleic Acid Constructs 

Another aspect of the present invention relates to nucleic acid 
30 constructs comprising a nucleic acid sequence encoding a 
transcriptional activator of the present invention operably linked to 
one or more control sequences, which direct the production of the 
transcriptional activator in a suitable expression host. In a 
preferred embodiment, the nucleic acid sequence encodes a polypeptide, 
35 which is contained in the plasmid pEES harboured in Escherichia coli 
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DSM 122 94, or the nucleic acid sequence shown in SEQ ID NO: 48 
encoding the polypeptide shown in SEQ ID NO: 49. 

Expression will be understood to include any step involved in the 
production of the polypeptide including, but not limited to, 
5 transcription, post-transcriptional modification, translation, post- 
translational modification, and secretion. Manipulation of the 
nucleic acid sequence encoding a polypeptide prior to its insertion 
into a vector may be desirable or necessary depending on the 
expression vector. The techniques for modifying nucleic acid 

10 sequences utilizing cloning methods are well known in the art. 

"Nucleic acid construct" is defined herein as a nucleic acid 
molecule, either single- or double -stranded, which is isolated from a 
naturally occurring gene or which has been modified to contain 
segments of nucleic acid which are combined and juxtaposed in a manner 

15 which would not otherwise exist in nature. The term nucleic acid 
construct is synonymous with the term expression cassette when the 
nucleic acid construct contains all the control sequences required for 
expression of a coding sequence. The term "coding sequence" as 
defined herein is a sequence, which is transcribed into mRNA and 

20 translated into a transcriptional activator of the present invention. 
The boundaries of the coding sequence are generally determined by the 
ATG start codon at the 5' end of the mRNA and a transcription 
terminator sequence located just downstream of the open reading frame 
at the 3' end of the mRNA. A coding sequence can include, but is not 

25 limited to, DNA, cDNA, and recombinant nucleic acid sequences. 

The term "control sequences" is defined herein to include all 
components, which are necessary or advantageous for the expression of 
a polypeptide. Each control sequence may be native or foreign to the 
nucleic acid sequence encoding the polypeptide. Such control 

30 sequences include, but are not limited to, a leader, a polyadenylation 
sequence, a propeptide sequence, a promoter, a signal sequence, and a 
transcription terminator. At a minimum, the control sequences include 
a promoter, and transcriptional and translational stop signals. The 
control sequences may be provided with linkers for the purpose of 

35 introducing specific restriction sites facilitating ligation of the 
control sequences with the coding region of the nucleic acid sequence 
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encoding a polypeptide. The term "operably linked" is defined herein 
as a configuration in which a control sequence is appropriately placed 
at a position relative to the coding sequence of the DNA sequence such 
that the control sequence directs the production of a polypeptide. 
5 The control sequence may be an appropriate promoter sequence, a 

nucleic acid sequence, which is recognized by a host cell for 
expression of the nucleic acid sequence. The promoter sequence 
contains transcriptional control sequences, which mediate the 
expression of the polypeptide. The promoter may be any nucleic acid 

10 sequence, which shows transcriptional activity in the cell including 
mutant, truncated, and hybrid promoters, and may be obtained from 
genes encoding extracellular or intracellular polypeptides either 
homologous or heterologous to the cell. 

The control sequence may also be a suitable transcription 

15 terminator sequence, a sequence recognized by a filamentous fungal 
cell to terminate transcription. The terminator sequence is operably 
linked to the 3' terminus of the nucleic acid sequence encoding the 
polypeptide. Any terminator, which is functional in the cell, may be 
used in the present invention. 

20 Preferred terminators for filamentous fungal cells are obtained 

from the genes encoding Aspergillus oryzae TAKA amylase, Aspergillus 
niger glucoamylase, Aspergillus nidulans anthranilate synthase, 
Aspergillus niger alpha-glucosidase, and Fusarium oxysporum trypsin- 
like protease. 

25 The control sequence may also be a suitable leader sequence, a 

nontranslated region of a mRNA which is important for translation by 
the filamentous fungal cell. The leader sequence is operably linked 
to the 5' terminus of the nucleic acid sequence encoding the 
polypeptide. Any leader sequence, which is functional in the cell, 

30 may be used in the present invention. 

Preferred leaders for filamentous fungal cells are obtained from 
the genes encoding Aspergillus oryzae TAKA amylase and Aspergillus 
nidulans triose phosphate isomerase. 

The control sequence may also be a polyadenylation sequence, a 

35 sequence which is operably linked to the 3' terminus of the nucleic 
acid sequence and which, when transcribed, is recognized by the 
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filamentous fungal cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence, which is functional 
in the cell, may be used in the present invention. 

Preferred polyadenylation sequences for filamentous fungal cells 
are obtained from the genes encoding Aspergillus oryzae TAKA amylase, 
Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate 
synthase, and Aspergillus niger alpha-glucosidase . 

The control sequence may also be a signal peptide-coding region, 
which codes for an amino acid sequence linked to the amino terminus of 
the polypeptide, which can direct the encoded polypeptide into the 
cell's secretory pathway. The 5' end of the coding sequence of the 
nucleic acid sequence may inherently contain a signal peptide-coding 
region naturally linked in translation reading frame with the segment 
of the coding region, which encodes the secreted polypeptide. 
Alternatively, the 5' end of the coding sequence may contain a signal 
peptide-coding region, which is foreign to the coding sequence. The 
foreign signal peptide-coding region may be required where the coding 
sequence does not normally contain a signal peptide-coding region. 
Alternatively, the foreign signal peptide-coding region may simply 
replace the natural signal peptide-coding region in order to obtain 
enhanced secretion of the polypeptide. The signal peptide-coding 
region may be obtained from a glucoamylase or an amylase gene from an 
Aspergillus species, or a lipase or proteinase gene from a Rhizomucor 
species. However, any signal peptide-coding region, which directs the 
expressed polypeptide into the secretory pathway of a filamentous 
fungal cell, may be used in the present invention. 

An effective signal peptide coding region for filamentous fungal 
cells is the signal peptide coding region obtained from the 
Aspergillus oryzae TAKA amylase gene, Aspergillus niger neutral 
amylase gene, Rhizomucor miehei aspartic proteinase gene, or Humicola 
lanuginosa cellulase gene. 

The control sequence may also be a propeptide coding region, 
which codes for an amino acid sequence positioned at the amino 
terminus of a polypeptide. The resultant polypeptide is known as a 
proenzyme or propolypeptide (or a zymogen in some cases) . A 
propolypeptide is generally inactive and can be converted to a mature 
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active polypeptide by catalytic or autocatalytic cleavage of the 
propeptide from the propolypeptide . The propeptide coding region may 
be obtained from the Rhizomucor miehei aspartic proteinase gene, or 
the Myceliophthora thermophila laccase gene (WO 95/33836) . 
5 Where both signal peptide and propeptide regions are present at 

the amino terminus of a polypeptide, the propeptide region is 
positioned next to the amino terminus of a polypeptide and the signal 
peptide region is positioned next to the amino terminus of the 
propeptide region. 

10 

Expression Vectors 

The present invention also relates to recombinant expression 
vectors comprising a nucleic acid sequence of the present invention, a 
promoter, and transcriptional and translational stop signals. The 

15 various nucleic acid and control sequences described above may be 
joined together to produce a recombinant expression vector which may 
include one or more convenient restriction sites to allow for 
insertion or substitution of the nucleic acid sequence encoding the 
polypeptide at such sites. Alternatively, the nucleic acid sequence 

20 encoding the polypeptide may be expressed by inserting the sequence or 
a nucleic acid construct comprising the sequence into an appropriate 
vector for expression. In creating the expression vector, the coding 
sequence is located in the vector so that the coding sequence is 
operably linked with the appropriate control sequences for expression, 

25 and possibly secretion. 

The recombinant expression vector may be any vector (e.g., a 
plasmid or virus) , which can be conveniently subjected to recombinant 
DNA procedures and can bring about the expression of the nucleic acid 
sequence encoding the polypeptide. The choice of the vector will 

30 typically depend on the compatibility of the vector with the 
filamentous fungal cell into which the vector is to be introduced. 
The vectors may be linear or closed circular plasmids. The vector may 
be an autonomously replicating vector, i.e., a vector, which exists as 
an extrachromosomal entity, the replication of which is independent of 

35 chromosomal replication, e.g., a plasmid, an extrachromosomal element, 
a mini chromosome, or an artificial chromosome. The vector may contain 



any means for assuring self -replication. Alternatively, the vector 
may be one which, when introduced into the filamentous fungal cell, is 
integrated into the genome and replicated together with the 
chromosome (s) into which it has been integrated. The vector system 
5 may be a single vector or plasmid or two or more vectors or plasmids, 
which together contain the total DNA to be introduced into the genome 
of the filamentous fungal cell, or a transposon. 

The vectors preferably contain one or more selectable markers, 
which permit easy selection of transformed cells. A selectable marker 

10 is a gene the product of which provides for biocide or viral 
resistance, resistance to heavy metals, prototrophy to auxotrophs, and 
the like. A selectable marker for use in a filamentous fungal cell 
may be selected from the group including, but not limited to, amdS 
(acetamidase) , argB (ornithine carbamoyl trans f erase) , bar 

15 (phosphinothricin acetyltransf erase) , hygB (hygromycin 

phosphotransferase), niaD (nitrate reductase), pyrG (orotidine-5 ' - 
phosphate decarboxylase) , sC (sulfate adenyltransf erase) , and trpC 
(anthranilate synthase), as well as equivalents from other species. 
Preferred for use in an Aspergillus cell are the amdS and pyrG genes 

20 of Aspergillus nidulans or Aspergillus oryzae and the bar gene of 
Streptojnyces hygroscopicus . 

The vectors preferably contain an element (s) that permits stable 
integration of the vector into the host cell genome or autonomous 
replication of the vector in the cell independent of the genome of the 

25 cell. 

Host Cells 

Another aspect of the present invention relates to host cells 
comprising a nucleic acid construct or an expression vector of the 
30 present invention. 

The choice of a host cell in the methods of the present invention 
will to a large extent depend upon the source of the nucleic acid 
sequence encoding the polypeptide of interest. 

The introduction of an expression vector or a nucleic acid 
35 construct into a filamentous fungal cell may involve a process 
consisting of protoplast formation, transformation of the protoplasts, 



and regeneration of the cell wall in a manner known per se. Suitable 
procedures for transformation of Aspergillus cells are described in EP 
238 023 and Yelton et al . , 1984, Proceedings of the National Academy 
of Sciences USA 81: 1470-1474. A suitable method of transforming 
5 Fusarium species is described by Malardier et al . , 1989, Gene 78: 147- 
156 or in WO 96/00787. 

"Introduction" means introducing a vector comprising the nucleic 
acid sequence encoding the polypeptide into a filamentous fungal cell 
so that the vector is maintained as a chromosomal integrant or as a 

10 self -replicating extra -chromosomal vector. Integration is generally 
considered to be an advantage as the nucleic acid sequence is more 
likely to be stably maintained in the cell. Integration of the vector 
into the chromosome occurs by homologous recombination, non- homologous 
recombination, or transposition. 

15 For integration into the host cell genome, the vector may rely on 

the nucleic acid sequence encoding the polypeptide or any other 
element of the vector for stable integration of the vector into the 
genome by homologous or nonhomologous recombination. Alternatively, 
the vector may contain additional nucleic acid sequences for directing 

20 integration by homologous recombination into the genome of the host 
cell. The additional nucleic acid sequences enable the vector to be 
integrated into the host cell genome at a precise location (s) in the 
chromosome (s) . To increase the likelihood of integration at a precise 
location, the integrational elements should preferably contain a 

25 sufficient number of nucleic acids, such as 100 to 1,500 base pairs, 
preferably 400 to 1,500 base pairs, and most preferably 800 to 1,500 
base pairs, which are highly homologous with the corresponding target 
sequence to enhance the probability of homologous recombination. The 
integrational elements may be any sequence that is homologous with the 

30 target sequence in the genome of the host cell. These nucleic acid 
sequences may be any sequence that is homologous with a target 
sequence in the genome of the host cell, and, furthermore, may be non- 
encoding or encoding sequences . 

For autonomous replication, the vector may further comprise an 

35 origin of replication enabling the vector to replicate autonomously in 
the host cell. 
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The procedures used to ligate the elements described above to 
construct the recombinant expression vectors are well known to one 
skilled in the art (see, e.g., Sambrook, et al., supra). 

In a preferred embodiment, the filamentous fungal host cell is a 
5 cell of a species of, but not limited to, Acremonium, Aspergillus, 
Fusarium, Humicola, Mucor, Myceliophthora, Neurospora, Penicillium, 
Thielavia, Tolypocladium, or Trichoderma. 

In a more preferred embodiment, the filamentous fungal cell is an 
Aspergillus cell. In another more preferred embodiment, the 

10 filamentous fungal cell is an Acremonium cell. In another more 
preferred embodiment, the filamentous fungal cell is a Fusarium cell. 
In another more preferred embodiment, the filamentous fungal cell is a 
Humicola cell. In another more preferred embodiment, the filamentous 
fungal cell is a Mucor cell. In another more preferred embodiment, 

15 the filamentous fungal cell is a Myceliophthora cell. In another more 
preferred embodiment, the filamentous fungal cell is a Neurospora 
cell. In another more preferred embodiment, the filamentous fungal 
cell is a Penicillium cell. In another more preferred embodiment, the 
filamentous fungal cell is a Thielavia cell. In another more 

20 preferred embodiment, the filamentous fungal cell is a Tolypocladium 
cell. In another more preferred embodiment, the filamentous fungal 
cell is a Trichoderma cell. 

In a most preferred embodiment, the filamentous fungal cell is an 
Aspergillus awamori, Aspergillus foetidus, Aspergillus japonicus , 

25 Aspergillus nidulans, Aspergillus niger or Aspergillus oryzae cell. 
In another most preferred embodiment, the filamentous fungal cell is a 
Fusarium bactridioides , Fusarium cerealis , Fusarium crookwellense , 
Fusarium culmorum, Fusarium graminearum, Fusarium graminum, Fusarium 
heterosporum, Fusarium negundi, Fusarium oxysporum, Fusarium 

30 reticulatum, Fusarium roseum, Fusarium sambucinum, Fusarium 
sarcochroum, Fusarium sulphureum, Fusarium toruloseum, Fusarium 
trichothecioides , or Fusarium venenatum cell. In an even most 
preferred embodiment, the filamentous fungal cell is a Fusarium 
venenatum (Nirenberg sp. nov.). In another most preferred embodiment, 

35 the filamentous fungal cell is a Humicola insolens or Humicola 
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lanuginosa cell. In another most preferred embodiment, the 

filamentous fungal cell is a Mucor miehei cell. In another most 
preferred embodiment, the filamentous fungal cell is a Myceliophthora 
thermophilic cell. In another most preferred embodiment, the 
5 filamentous fungal cell is a Neurospora crassa cell. In another most 
preferred embodiment, the filamentous fungal cell is a Penicillium 
purpurogenum cell. In another most preferred embodiment, the 
filamentous fungal cell is a Thielavia terrestris cell. In another 
most preferred embodiment, the Trichoderma cell is a Trichoderma 
10 harzianum, Trichoderma koningii , Trichoderma longibrachiatum, 
Trichoderma reesei or Trichoderma viride cell. 

Polypeptides having Transcriptional Activation Activity 

Another aspect of the present invention relates to an isolated 
15 polypeptide selected from the group consisting of: 

(a) a polypeptide which is encoded in a nucleic acid sequence 
which hybridizes under low stringency conditions with (i) the nucleic 
acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48; (ii) its complementary 
strand, or (iii) a subsequence of SEQ ID NO: 1 or SEQ ID NO: 48 which 

20 encodes a polypeptide fragment which has transcriptional activation 
activity, wherein the low stringency conditions are defined by 
prehybridization and hybridization at 42°C in 5X SSPE, 0.3% SDS, 200 
micro g/ml sheared and denatured salmon sperm DNA, and 25% formamide, 
and wash conditions are defined at 50 °C for 30 minutes in 2X SSC, 0.2% 

25 SDS ; 

(b) a polypeptide having an amino acid sequence which has at 
least 50% identity with the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO: 49; 

(c) an allelic variant of (a) or (b) ; 

30 (d) a fragment of (a) , (b) , or (c) , wherein the fragment has 

transcriptional activation activity; and 

(e) a polypeptide comprising the amino acid sequence of SEQ ID 
NO: 3, or an allelic variant thereof. 

The transcriptional activator may be isolated using techniques as 
35 described herein. As defined herein, an "isolated" polypeptide is a 
polypeptide which is essentially free of other polypeptides, e.g., at 



least about 20% pure, preferably at least about 40% pure, more 
preferably about 60% pure, even more preferably about 80% pure, most 
preferably about 90% pure, and even most preferably about 95% pure, as 
determined by SDS-PAGE. 
5 The present invention also relates to isolated polypeptides 

having an amino acid sequence which has a degree of identity to the 
amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49 of at least about 
50%, preferably at least about 55%, preferably at least about 60%, 
preferably at least about 65%, preferably at least about 70%, 

10 preferably at least about 75%, preferably at least about 8 0%, more 
preferably at least about 85%, even more preferably at least about 
90%, most preferably at least about 95%, and even most preferably at 
least about 97%, even more preferred at least 99%, which have 
transcriptional activation activity. 

15 In more preferred embodiment, the transcriptional activator of 

the present invention comprises the amino acid sequence of SEQ ID NO: 
2 or SEQ ID NO: 4 9 or a fragment thereof, wherein the fragment retains 
transcriptional activation activity. In a most preferred embodiment, 
the polypeptide has the amino acid sequence of SEQ ID NO: 2 or SEQ ID 

20 NO: 49. A fragment of SEQ ID NO: 2 or SEQID NO: 49 is a polypeptide 
having one or more amino acids deleted from the amino and/or carboxy 
terminus of this amino acid sequence. Preferably, a fragment of SEQ 
ID NO: 2 or SEQ ID NO: 49 contains at least the polypeptide sequence 
shown in SEQ ID NO: 3. 

25 The amino acid sequences of the homologous polypeptides may 

differ from the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 49 
or SEQ ID NO: 3 by an insertion or deletion of one or more amino acid 
residues and/or the substitution of one or more amino acid residues by 
different amino acid residues. Preferably, amino acid changes are of 

30 a minor nature, that is conservative amino acid substitutions that do 
not significantly affect the folding and/or activity of the protein; 
small deletions, typically of one to about 30 amino acids; small 
amino- or carboxyl -terminal extensions, such as an amino- terminal 
methionine residue; a small linker peptide of up to about 20-25 

35 residues; or a small extension that facilitates purification by 
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changing net charge or another function, such as a poly-histidine 
tract, an antigenic epitope or a binding domain. 

Examples of conservative substitutions are within the group of 
basic amino acids (such as arginine, lysine and histidine) , acidic 
5 amino acids (such as glutamic acid and aspartic acid) , polar amino 
acids (such as glutamine and asparagine) , hydrophobic amino acids 
(such as leucine, isoleucine and valine) , aromatic amino acids (such 
as phenylalanine, tryptophan and tyrosine) , and small amino acids 
(such as glycine, alanine, serine, threonine and methionine) . Amino 

10 acid substitutions, which do not generally alter the specific activity 
are known in the art and are described, for example, by H. Neurath and 
R.L. Hill (1979. The Proteins, Academic Press, New York) . The most 
commonly occurring exchanges are Ala/Ser, Val/Ile, Asp/Glu, Thr/Ser, 
Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, Ala/Pro, 

15 Lys/Arg, Asp/Asn, Leu/Ile, Leu/Val, Ala/Glu, and Asp/Gly as well as 
these in reverse. 

In a more preferred embodiment, a transcriptional activator of 
the present invention is obtained from an Aspergillus niger strain, 
more preferably from Aspergillus niger AB4 . 1 (van Haringsveldt , W. , et 

20 al., 1987. Mol. Gen. Genet. 206:71-75), and most preferably from 
Aspergillus niger 13PAP2, which has been deposited at DSM as DSM 
12298, or a mutant strain thereof, harbouring, e.g., the polypeptide 
with the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4 9 or SEQ 
ID NO: 3. 

25 In another preferred embodiment, the transcriptional activator of 

the present invention is the polypeptide encoded in the nucleic acid 
sequence contained in plasmid pEES, which is contained in Escherichia 
coli DSM 12294 or the nucleic acid sequence shown in SEQ ID NO: 48 
encoding the polypeptide shown in SEQ ID NO: 49. 

30 The present invention further relates to methods for producing 

the transcriptional activator of the present invention comprising (a) 
cultivating a host cell harbouring a nucleic acid construct or an 
expression vector comprising a nucleic acid sequence encoding the 
transcriptional activator of the invention under conditions conducive 

35 for production of the polypeptide; and (b) recovering the polypeptide. 
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Host Cells having Altered Transcriptional Activation Activity 

Another aspect of the present invention relates to a host cell 
which is a mutant of a parent fungal cell useful for the production of 
a polypeptide in which the parent cell comprises one or more nucleic 
5 acid sequences encoding a protease, the transcription of which is 
activated by a transcriptional activator of the present invention, and 
the mutant cell produces less of the transcriptional activator and the 
protease (s) than the parent cell when cultured under the same 
conditions . 

10 The mutant cell may be constructed using methods well known in the 

art; for example, by one or more nucleotide insertions or deletions of 
the gene encoding the transcriptional activator. 

In a preferred embodiment the mutant cell is obtained by 
modification or inactivation of a nucleic acid sequence present in the 

15 cell and necessary for expression of the transcriptional activator. 

In a more preferred embodiment, the nucleic acid sequence is 
selected from the group consisting of: (a) a nucleic acid sequence 
having at least 70% identity with the nucleic acid sequence of SEQ ID 
NO: 1 or SEQ ID NO: 48; (b) a nucleic acid sequence encoding a 

20 polypeptide having an amino acid sequence which has at least 50% 
identity with the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 
49; (c) a nucleic acid sequence which hybridizes under low stringency 
conditions with (i) the nucleic acid sequence of SEQ ID NO: 1 or SEQ 
ID NO: 48, or (ii) its complementary strand, (d) an allelic variant of 

25 (a) , (b) , or (c) ; (e) a subsequence of (a) , (b) , (c) , or (d) , wherein 
the subsequence encodes a polypeptide fragment which has 
transcriptional activation activity; and (f ) a subsequence of (a) , (b) 
(c) , or (d) , wherein the subsequence encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 3. 

30 In another preferred embodiment the reduced expression of the 

transcriptional activator in the mutant cell is obtained by modification 
or inactivation of a control sequence required for the expression of the 
transcriptional activator. The term "control sequence" is defined, 
supra, in the section entitled "Nucleic Acid Constructs." In a more 

35 preferred embodiment the control sequence in the mutant cell is a 
promoter sequence or a functional part thereof, i.e., a part, which is 
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sufficient for affecting expression of the nucleic acid sequence. 
Other control sequences for possible modification include, but are not 
limited to, a leader, a polyadenylation sequence, a propeptide 
sequence, a signal sequence, and a transcription terminator. 
5 Modification or inactivation of the gene may be performed by 

subjecting the parent cell to mutagenesis and selecting for mutant 
cells in which the capability to produce a transcriptional activator 
has been reduced. The mutagenesis, which may be specific or random, 
may be performed, for example, by use of a suitable physical or 

10 chemical mutagenizing agent, by use of a suitable oligonucleotide, or 
by subjecting the DNA sequence to PCR generated mutagenesis. 
Furthermore, the mutagenesis may be performed by use of any 
combination of these mutagenizing agents. 

Examples of a physical or chemical mutagenizing agent suitable 

15 for the present purpose include ultraviolet (UV) irradiation, 
hydroxy 1 amine, N-methyl-N' -nitro-N-nitrosoguanidine (MNNG) , 0-methyl 
hydroxy 1 amine, nitrous acid, ethyl methane sulphonate (EMS) , sodium 
bisulphite, formic acid, and nucleotide analogues. 

When such agents are used, the mutagenesis is typically performed 

20 by incubating the parent cell to be mutagenized in the presence of the 
mutagenizing agent of choice under suitable conditions, and selecting 
for mutant cells exhibiting reduced expression of the gene. 

Modification or inactivation of the gene may be accomplished by 
introduction, substitution, or removal of one or more nucleotides in 

25 the gene's nucleic acid sequence or a regulatory element required for 
the transcription or translation thereof. For example, nucleotides 
may be inserted or removed so as to result in the introduction of a 
stop codon, the removal of the start codon, or a change of the open 
reading frame. Such modification or inactivation may be accomplished 

30 by site-directed mutagenesis or PCR generated mutagenesis in 
accordance with methods known in the art. Although, in principle, the 
modification may be performed in vivo, i.e., directly on the fungal 
cell expressing the gene to be modified, it is preferred that the 
modification be performed in vitro as exemplified below. 

35 An example of a convenient way to inactivate or reduce expression 

of the gene by a fungal cell of choice is based on techniques of gene 
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replacement or gene interruption. For example, in the gene 

interruption method, a nucleic acid sequence corresponding to the 
endogenous gene or gene fragment of interest is mutagenized in vitro 
to produce a defective nucleic acid sequence which is then transformed 
5 into the parent cell to produce a defective gene. By homologous 
recombination, the defective nucleic acid sequence replaces the 
endogenous gene or gene fragment. It may be desirable that the 
defective gene or gene fragment also encodes a marker, which may be 
used for selection of transf ormants in which the nucleic acid sequence 

10 has been modified or destroyed. 

Alternatively, modification or inactivation of the gene may be 
performed by established anti -sense techniques using a nucleotide 
sequence complementary to the nucleic acid sequence of the gene. More 
specifically, expression of the gene by a filamentous fungal cell may 

15 be reduced or eliminated by introducing a nucleotide sequence 
complementary to the nucleic acid sequence, which may be transcribed 
in the cell and is capable of hybridizing to the mRNA produced in the 
cell. Under conditions allowing the complementary anti-sense 

nucleotide sequence to hybridize to the mRNA, the amount of protein 

20 translated is thus reduced or eliminated. 

A nucleic acid sequence complementary to the nucleic acid 
sequence of SEQ ID NO: 1 or SEQ ID NO: 48 may be obtained from any 
microbial source. The preferred sources are fungal sources, e.g., 
yeast and filamentous fungi as described supra. Preferred filamentous 

25 fungal sources include, but are not limited to, species of Acremonium, 
Aspergillus, Fusarium, Humicola, Myceliophthora, Mucor, Neurospora, 
Penicillium, Phanerochaete, Thielavia, Tolypocladium, and Trichoderma. 
Preferred yeast sources include, but are not limited to, species of 
Candida, Hansenula, Kluyveromyces , Pichia, Saccharomyces , 

30 Schizosaccharomyces , and Yarrowia. Furthermore, the nucleic acid 
sequence may be native to the filamentous fungal cell. 

In another preferred embodiment, the parent cell harbours a gene 
having a nucleic acid sequence encoding a polypeptide with an amino 
acid sequence which has at least 50% identity with the amino acid 

35 sequence of SEQ ID NO: 2 or SEQID NO: 49. 
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In another preferred embodiment, the parent cell harbours a gene 
having a nucleic acid sequence with at least 70% identity with the 
nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO: 48. 

In another preferred embodiment, the mutant cell harbours a 
5 nucleic acid sequence, which has been modified or inactivated by any of 
the methods described above and produces less of a protease or a 
combination of proteases than the parent cell when cultured under 
identical conditions. The mutant cell produces preferably at least 
about 25% less, more preferably at least about 50% less, even more 

10 preferably at least about 75% less, and even more preferably at least 
about 95% less of a protease or a combination of proteases than the 
parent cell when cultured under identical conditions. 

In an even more preferred embodiment, the mutant cell produces 
essentially undetectable amounts of a protease or combination of 

15 proteases than the parent cell when cultured under identical conditions. 

The protease (s) may be assayed using known methods. In one such 
method, an aliquot of a 48 hour culture media is incubated with 3 H- 
labelled sperm whale myoglobin at pH 4.0 and the radioactivity in the 
TCA-soluble fraction is measured (van Noort, J.M., et al . , 1991. Anal. 

20 Biochem 198:385-390). Other methods have been described for 

identifying, e.g., aspartic proteinase A. of A. niger (Takahashi, K., 
1991. Meth. in Enzymol . 248:146-155), endopeptidases (Morihara, K., 
1995. Meth. in Enzymol. 248:242-253), carboxypeptidases (Reminton, J., 
and Breddam, K. , 1994. Meth. in Enzymol. 244:231-248), dipeptidyl 

25 peptidase (Ikehara, Y., et al . , 244:215-227), and aminopeptidases 
(Little, G., et al . , 1976. Meth. in Enzymol. 45:495-503). 

In another preferred embodiment, the mutant cell harbours at least 
one copy of a nucleic acid sequence encoding a polypeptide of interest. 

Another aspect of the present invention relates to a host cell 

30 useful for the production of a polypeptide wherein the host cell is a 
mutant of a parent fungal cell in which the mutant (a) produces more 
of the transcriptional activator of the present invention as compared 
to the parent cell when cultured under the same conditions; and (b) 
comprises a DNA sequence encoding the polypeptide, the transcription 

35 of which is activated by the transcriptional activator. 
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In a preferred embodiment, the host cell produces more of the 
transcriptional activator than the parent cell when cultured under the 
same conditions by introducing into the parent cell one or more copies 
of (i) a nucleic acid sequence encoding a transcriptional activator, 
(ii) a nucleic acid construct comprising a nucleic acid sequence 
encoding a transcriptional activator, or (iii) an expression vector as 
defined above in the section "Expression Vectors" . 

The nucleic acid construct comprising a nucleic acid sequence 
encoding a transcriptional activator of the present invention may also 
comprise one or more nucleic acid sequences which encode one or more 
factors that are advantageous for directing the expression of the 
polypeptide, e.g., a transcriptional activator (e.gr., a trans-acting 
factor), a chaperone, and a processing protease. Any factor that is 
functional in the filamentous fungal cell of choice may be used in the 
present invention. The nucleic acids encoding one or more of these 
factors are not necessarily in tandem with the nucleic acid sequence 
encoding the polypeptide. 

An activator is a protein, which activates transcription of a 
nucleic acid sequence encoding a polypeptide (Kudla et al . , 1990, EMBO 
Journal 9: 1355-1364; Jarai and Buxton, 1994, Current Genetics 26: 
2238-244; Verdier, 1990, yeast 6: 271-297). The nucleic acid sequence 
encoding an activator may be obtained from the genes encoding 
Saccharomyces cerevisiae heme activator protein 1 (hapl) , 
Sac char omyces cerevisiae galactose metabolizing protein 4 {gal4) , 
Aspergillus nidulans ammonia regulation protein (areA) , and 
Aspergillus oryzae alpha-amylase activator (amyR) . For further 
examples, see Verdier, 1990, supra and MacKenzie et al., 1993, Journal 
of General Microbiology 139: 2295-2307. 

A chaperone is a protein which assists another polypeptide in 
folding properly (Hartl et al., 1994, TIBS 19: 20-25; Bergeron et al . , 
1994, TIBS 19: 124-128; Demolder et al., 1994, Journal of 
Biotechnology 32: 179-189; Craig, 1993, Science 260: 1902-1903; 
Gething and Sambrook, 1992, Nature 355: 33-45; Puig and Gilbert, 1994, 
Journal of Biological Chemistry 269: 7764-7771; Wang and Tsou, 1993, 
The FASEB Journal 7: 1515-11157; Robinson et al., 1994, Bio/Technology 
1: 381-384; Jacobs et al . , 1993, Molecular Microbiology 8: 957-966). 
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The nucleic acid sequence encoding a chaperone may be obtained from 
the genes encoding Aspergillus oryzae protein disulphide isomerase or 
Saccharomyces cerevisiae calnexin, Saccharomyces cerevisiae BiP/GRP78 , 
and Saccharomyces cerevisiae Hsp70. For further examples, see Gething 
5 and Sambrook, 1992, supra, and Hartl et al . , 1994, supra. 

A processing protease is a protease that cleaves a propeptide to 
generate a mature biochemically active polypeptide (Enderlin and 
Ogrydziak, 1994, yeast 10: 67-79; Fuller et al . , 1989, Proceedings of 
the National Academy of Sciences USA 86: 1434-1438; Julius et al., 

10 1984, Cell 37: 1075-1089; Julius et al . , 1983, Cell 32: 839-852; U.S. 
Patent No. 5,702,934). The nucleic acid sequence encoding a 
processing protease may be obtained from the genes encoding 
Saccharomyces cerevisiae dipeptidylaminopeptidase, Saccharomyces 
cerevisiae Kex2, Yarrowia lipolytica dibasic processing endoprotease 

15 (xpr6) , and Fusarium oxysporum metalloprotease (p45 gene) . 

In a more preferred embodiment, the nucleic acid sequence 
encoding the transcriptional activator is operably linked to a 
promoter, or a functional part thereof, which is stronger than the 
corresponding promoter of the parent cell. In an even more preferred 

20 embodiment, the promoter, or a functional part thereof, mediates the 
expression of a gene encoding an extracellular protease, such as the 
Aspergillus oryzae alkaline protease, A. oryzae neutral 
metalloprotease, A. niger aspergillopepsin protease, Fusarium 
oxysporum trypsin- like protease or F. venenatum trypsin. 

25 The present invention also relates to a host cell useful for the 

production of a polypeptide wherein the host cell is a mutant of a 
parent fungal cell in which the mutant comprises 

a) a modification or inactivation of a transcriptional 
activator of the present invention, or a regulatory sequence thereof, 

30 and 

b) (i) an inducible promoter operably linked to a nucleic acid 
sequence encoding a transcriptional activator of the present 
invention, and (ii) a promoter sequence to which the transcriptional 
activator can bind, operably linked to a nucleic acid sequence 

35 encoding the polypeptide, wherein (i) and (ii) can be introduced 
simultaneously or sequentially. 



The inactive form of the transcriptional activator in (a) above 
is obtained by inactivation or modification of a nucleic acid sequence 
present in the cell and necessary for the expression of the native 
transcriptional activator according to any of the methods as disclosed 
5 supra. In a preferred embodiment the inactivation or modification is 
obtained by methods, which include, but are not limited to, one or 
more nucleotide insertions, deletions or substitutions, specific or 
random mutagenesis, gene replacement or gene interruption, and anti- 
sense techniques using a nucleotide sequence complementary to the 

10 nucleic acid sequence of the transcriptional activator. In another 
preferred embodiment, the inactive form of the native transcriptional 
activator is obtained by inactivation or modification of a control 
sequence required for the expression of the transcriptional activator. 

In another preferred embodiment, the nucleic acid sequence 

15 encoding the native transcriptional activator has the sequence set 
forth in SEQ ID NO: 1 or SEQ ID NO: 48. In another preferred 
embodiment, the transcriptional activator comprises the polypeptide 
having the amino acid sequence in SEQ ID NO: 3. 

The inducible promoter sequence in (b) above may be any promoter 

20 sequence, or a functional part thereof, wherein the transcription 
initiation activity of the promoter can be induced according to the 
fermentation conditions. Preferably, the induction is mediated by a 
carbon or nitrogen catabolite. In a preferred embodiment, the promoter 
is the amdS promoter of Aspergillus nidulans or A. oryzae, the niaD 

25 promoter of A. nidulans, A. oryzae or A. niger, the niiA promoter of 
Aspergillus species, the alkaline phosphatase promoter of Aspergillus 
sp., the acid phosphatase promoter of Aspergillus sp., or the alcA 
promoter of A. niger. 

In another preferred embodiment, the host cell further comprises 

30 a promoter sequence, wherein the promoter sequence can be activated by 
the transcriptional activator and is operably linked to the nucleic 
acid sequence encoding the polypeptide. 

The promoter sequence activated by the transcriptional activator 
of the present invention may be any promoter sequence, or a functional 

35 part thereof, selected from the group which includes but is not 
limited to promoters obtained from the genes encoding Aspergillus 
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oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, 
Aspergillus niger neutral alpha-amylase, Aspergillus niger acid stable 
alpha -amylase, Aspergillus niger or Aspergillus awamori glucoamylase 
(glaA) , Rhizomucor miehei lipase, Aspergillus oryzae alkaline 
5 protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus 
nidulans acetamidase, the NA2-tpi promoter (a hybrid of the promoters 
from the genes encoding Aspergillus niger neutral alpha-amylase and 
Aspergillus oryzae triose phosphate isomerase) , and mutant, truncated, 
and hybrid promoters thereof. Particularly preferred promoters for 

10 use in filamentous fungal cells are a promoter, or a functional part 
thereof, from a protease gene; e.g., from the Fusarium oxysporum 
trypsin-like protease gene (U.S. Patent No. 4,288,627), Aspegillus 
oryzae alkaline protease gene (alp) , Aspergillus niger pacA gene, 
Aspergillus oryzae alkaline protease gene, A. oryzae neutral 

15 metalloprotease gene, A. niger aspergillopepsin protease gene, or F. 
venenatum trypsin gene. 

In another preferred embodiment, the host cell harbours at least 
one copy of a nucleic acid sequence encoding a polypeptide. 

In another preferred embodiment, the host cell, which expresses 

20 the transcriptional activator of the present invention produces less 
of one or more native proteases than the parent cell when cultured 
under identical conditions. The protease (s) may be assayed using any 
of the methods described above. In a more preferred embodiment, an 
aliquot from a 48 -hour culture media is incubated with 3 H-labelled 

25 sperm whale myoglobin at pH 4.0 and the radioactivity in the TCA- 
soluble fraction is measured (van Noort, J.M., et al . , supra). 

The nucleic acid constructs described herein may be introduced 
into a parent fungal cell according to any of the methods as described 
supra in the section, "Host Cells" to obtain a host cell useful for 

30 the production of a polypeptide. In a preferred embodiment the 
nucleic acid construct is integrated into the chromosome of the cell. 
In another preferred embodiment the nucleic acid construct is 
maintained as a self -replicating extra -chromosomal vector. 

It will be understood that the methods of the present invention 

35 are not limited to a particular order for obtaining the mutant fungal 
cell. The modification of the second nucleic acid sequence may be 
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introduced into the parent cell at any step in the construction of the 
cell for the production of a polypeptide. 

Producing a Polypeptide 

5 Another aspect of the present invention relates to methods of 

producing a polypeptide in a host cell of the present invention, 
comprising: (a) cultivating the host cell which harbours a gene 
encoding the polypeptide in a nutrient medium suitable for production of 
the polypeptide; and (b) recovering the polypeptide from the nutrient 

10 medium of the host cell. 

In one embodiment, the host cell which is a mutant of a parent 
fungal cell in which the parent cell comprises one or more nucleic 
acid sequences encoding a protease, the transcription of which is 
activated by a transcriptional activator of the present invention, and 

15 the mutant cell produces less of the transcriptional activator and the 
protease (s) than the parent cell when cultured under the same 
conditions . 

In another embodiment, the host cell is a mutant of a parent 
fungal cell in which the mutant (a) produces more of the 

20 transcriptional activator of the present invention as compared to the 
parent cell when cultured under the same conditions; and (b) comprises 
a DNA sequence encoding the polypeptide, the transcription of which is 
activated by the transcriptional activator. 

In another embodiment, the host cell is a mutant of a parent 

25 fungal cell in which the mutant comprises (a) a modification or 
inactivation of a transcriptional activator of the present invention 
or a regulatory sequence thereof, and (b) an inducible promoter 
operably linked to a nucleic acid sequence encoding a transcriptional 
activator of the present invention and a promoter sequence to which 

30 the transcriptional activator can bind, operably linked to a nucleic 
acid sequence encoding the polypeptide, wherein (i) and (ii) can be 
introduced simultaneously or sequentially. 

The host cells of the present invention are cultivated in a 
nutrient medium suitable for production of the polypeptide of interest 

35 using methods known in the art. For example, the cells may be 
cultivated by shake flask cultivation, small-scale or large-scale 
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fermentation (including continuous, batch, fed-batch, or solid state 
fermentations) in laboratory or industrial fermentors performed in a 
suitable medium and under conditions allowing the polypeptide to be 
expressed and/or isolated. The cultivation takes place in a suitable 
5 nutrient medium comprising carbon and nitrogen sources and inorganic 
salts, using procedures known in the art (see, e.g., Bennett, J.W. and 
LaSure, L . , eds . , More Gene Manipulations in Fungi, Academic Press, 
CA, 1991) . Suitable media are available from commercial suppliers or 
may be prepared using published compositions (e.g., in catalogues of 

10 the American Type Culture Collection) . If the polypeptide is secreted 
into the nutrient medium, the polypeptide can be recovered directly 
from the medium. If the polypeptide is not secreted, it is recovered 
from cell lysates . 

The resulting polypeptide may be isolated by methods known in the 

15 art. For example, the polypeptide may be isolated from the nutrient 
medium by conventional procedures including, but not limited to, 
centrifugation, filtration, extraction, spray drying, evaporation, or 
precipitation. The isolated polypeptide may then be further purified 
by a variety of procedures known in the art including, but not limited 

20 to, chromatography (e.g., ion exchange, affinity, hydrophobic, 
chromatof ocusing, and size exclusion) , electrophoretic procedures 
(e.g., preparative isoelectric focusing, differential solubility 
(e.g., ammonium sulfate precipitation), or extraction (see, e.g., 
Protein Purification, J.-C. Janson and Lars Ryden, editors, VCH 

25 Publishers, New York, 1989) . 

The polypeptide may be detected using methods known in the art 
that are specific for the polypeptide. These detection methods may 
include use of specific antibodies, formation of an enzyme product, 
disappearance of an enzyme substrate, or SDS-PAGE. For example, an 

30 enzyme assay may be used to determine the activity of the polypeptide. 
Procedures for determining enzyme activity are known in the art for 
many enzymes . 

In the methods of the present invention, the host cell produces 
at least about 20% more, preferably at least about 50% more, more 
35 preferably at least about 100% more, even more preferably at least 
about 200% more, and most preferably at least about 300% more of the 



polypeptide than a corresponding parent cell when cultivated under the 
same conditions. 

The polypeptide may be any polypeptide whether native or 
heterologous to the mutant filamentous fungal cell. The term 
5 "heterologous polypeptide" is defined herein as a polypeptide, which 
is not produced by a cell. The term "polypeptide" is not meant herein 
to refer to a specific length of the encoded produce and therefore 
encompasses peptides, oligopeptides and proteins. The polypeptide may 
also be a recombinant polypeptide, which is a polypeptide native to a 

10 cell, which is encoded by a nucleic acid sequence, which comprises one 
or more control sequences foreign to the nucleic acid sequence, which 
are involved in the production of the polypeptide. The polypeptide 
may be a wild-type polypeptide or a variant thereof. The polypeptide 
may also be a hybrid polypeptide, which contains a combination of 

15 partial or complete polypeptide sequences obtained from at least two 
different polypeptides where one or more of the polypeptides may be 
heterologous to the cell. Polypeptides further include naturally 
occurring allelic and engineered variations of the above-mentioned 
polypeptides . 

20 In a preferred embodiment, the polypeptide is an antibody or 

portions thereof, an antigen, a clotting factor, an enzyme, a hormone 
or a hormone variant, a receptor or portions thereof, a regulatory 
protein, a structural protein, a reporter, or a transport protein. 

In a more preferred embodiment, the enzyme is an oxidoreductase, 

25 transferase, hydrolase, lyase, isomerase, or ligase. 

In an even more preferred embodiment, the enzyme is an 
aminopeptidase, amylase, carbohydrase, carboxypeptidase, catalase, 
cellulase, chitinase, cutinase, deoxyribonuclease, dextranase, 
esterase , alpha-galactosidase , beta-galactosidase , glucoamylase , 

30 alpha-glucosidase, beta-glucosidase, haloperoxidase, invertase, 
laccase, lipase, mannosidase, mutanase, oxidase, pectinolytic enzyme, 
peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, 
ribonuclease, transglutaminase, or xylanase . 

In another even more preferred embodiment, the polypeptide is 

35 human insulin or an analog thereof, human growth hormone, 
erythropoietin, or insulinotropin. 
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The nucleic acid sequence encoding a heterologous polypeptide may 
be obtained from any prokaryotic, eukaryotic, or other source. For 
purposes of the present invention, the term "obtained from" as used 
herein in connection with a given source shall mean that the 
5 polypeptide is produced by the source or by a cell in which a gene 
from the source has been inserted. 

In the methods of the present invention, the mutant filamentous 
fungal cells may also be used for the recombinant production of 
polypeptides, which are native to the cell. The native polypeptides 

10 may be recombinantly produced by, e.g., placing a gene encoding the 
polypeptide under the control of a different promoter to enhance 
expression of the polypeptide, to expedite export of a native 
polypeptide of interest outside the cell by use of a signal sequence, 
and to increase the copy number of a gene encoding the polypeptide 

15 normally produced by the cell. The present invention also 

encompasses, within the scope of the term "heterologous polypeptide", 
such recombinant production of polypeptides native to the cell, to the 
extent that such expression involves the use of genetic elements not 
native to the cell, or use of native elements which have been 

20 manipulated to function in a manner that do not normally occur in the 
filamentous fungal cell. The techniques used to isolate or clone a 
nucleic acid sequence encoding a heterologous polypeptide are known in 
the art and include isolation from genomic DNA, preparation from cDNA, 
or a combination thereof . The cloning of the nucleic acid sequences 

25 from such genomic DNA can be effected, e.g., by using the well known 
polymerase chain reaction (PCR) . See, for example, Innis et al., 
1990, PCR Protocols : A Guide to Methods and Application, Academic 
Press, New York. The cloning procedures may involve excision and 
isolation of a desired nucleic acid fragment comprising the nucleic 

30 acid sequence encoding the polypeptide, insertion of the fragment into 
a vector molecule, and incorporation of the recombinant vector into 
the mutant fungal cell where multiple copies or clones of the nucleic 
acid sequence will be replicated. The nucleic acid sequence may be of 
genomic, cDNA, RNA, semisynthetic, synthetic origin, or any 

35 combinations thereof. 
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In the methods of the present invention, heterologous 
polypeptides may also include fused or hybrid polypeptides in which 
another polypeptide is fused at the N-terminus or the C-terminus of 
the polypeptide or fragment thereof. A fused polypeptide is produced 
5 by fusing a nucleic acid sequence (or a portion thereof) encoding one 
polypeptide to a nucleic acid sequence (or a portion thereof) encoding 
another polypeptide. Techniques for producing fusion polypeptides are 
known in the art, and include, ligating the coding sequences encoding 
the polypeptides so that they are in frame and expression of the fused 

10 polypeptide is under control of the same promoter (s) and terminator. 
The hybrid polypeptides may comprise a combination of partial or 
complete polypeptide sequences obtained from at least two different 
polypeptides wherein one or more may be heterologous to the mutant 
fungal cell. An isolated nucleic acid sequence encoding a heterologous 

15 polypeptide of interest may be manipulated in a variety of ways to 
provide for expression of the polypeptide. Expression will be 
understood to include any step involved in the production of the 
polypeptide including, but not limited to, transcription, post- 
transcript ional modification, translation, post -translat ional 

20 modification, and secretion. Manipulation of the nucleic acid 
sequence encoding a polypeptide prior to its insertion into a vector 
may be desirable or necessary depending on the expression vector. The 
techniques for modifying nucleic acid sequences utilizing cloning 
methods are well known in the art. 

25 The present invention is further described by the following 

examples, which should not be construed as limiting the scope of the 
invention. 

EXAMPLES 
30 Materials 

Chemicals used as buffers and substrates were commercial products 
of at least reagent grade. 

Strains 

35 AB4.1: a strain of Aspergillus niger which is a cspAl pyrGl 

derivative of strain ATTC 9029 (van Hartingsveldt , W., et al . , 1987. 



35 



Mol. Gen. Genet. 206:71-75; Bos, C.J., et al . , Curr. Genet. 14:437- 
443) 

AB1.13: a protease deficient strain of Aspergillus niger 
derived from UV mutagenesis of AB4 . 1 (Mattern, I.E., et al . , 1992. 
5 Mol. Gen. Genet. 234:332-336) 

13PAP2: an AB1.13 derivative containing multiple copies of the 
A. nidulans amdS gene (Corrick, R. A., et al, 1987. Gene 53 : 63 - 71) 
under control of the pepA promoter of A. niger (Jarai G. and Buxton F. 
1994. Curr Genet 26:238-244). The strain has a protease deficient 
10 phenotype and is unable to grow on medium containing acetamide as the 
sole nitrogen source. Strain 13PAP2 has been deposited at DSM under 
the name DSM No. 12298. 

4PAP6: an AB4 . 1 derivative containing multiple copies of the 

of A. nidulans amdS gene under control of the pepA promoter of A. 
15 niger. The strain does not have a protease deficient phenotype and is 
able to grow on medium containing acetamide as the sole nitrogen 
source . 

N402:a strain of Aspergillus niger, deposited at the ATCC 
(Manassas VA, USA) as ATCC Number: 64 974 
20 MC1046: a strain of E. coli, deposited at the ATCC as ATCC 

Number: 35467 

A.oryzae IF04177: available from Institute for Fermentation, 
Osaka; 17-25 Juso Hammachi 2-Chome Yodogawa-Ku, Osaka, Japan. 
HowBlOl: described in WO 97/35956. 

25 

Plasmids 

pPAP: constructed as described below in Example 1 and shown in 
Figure 1 

pAopyrGcosArpl : constructed as described below in Example 1 and 
30 shown in Figure 2 

pEESl : constructed as described below in Example 1 and shown 

in Figure 3 

p3SR2: contains the A. nidulans amdS gene as described by 

CM. Corrick, A. P. Twomey, and M.J. Hynes (1987. Gene 53: 63-71) 
35 pABPYRG* -Not : contains an inactivated pyrG gene as described by 

Verdoes, J.C., et al . (1994. Gene 145: 179-187) 
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pHelpl: contains the pyrG gene from A. oryzae as a selective 
marker and the AMA1 sequences which enable autonomous replication in 
A. niger, cloned into the E. coli vector pIC20R, as described by Gems, 
D., et al. (1991. Gene 98: 61-67) 
5 pAnscosl: contains two cos sites as described by Osiewacz, H.D. 

(1994. Curr. Genet. 26: 87-90) 

pA04-2: contains the A. oryzae pyrG gene as described by De 
Ruiter- Jacobs, Y.M.J.T., et al. (1989. Curr. Genet. 16: 159-163) 

pA04-13: contains the A. oryzae pyrG gene as described by De 
10 Ruiter- Jacobs, Y.M.J.T., et al. (1989. Curr. Genet. 16:159-163) 

pUC19: as described by Yanisch-Perron, C, Vieira, J. and 

Messig, J. (1985, Gene 33:103-119) 

pDV8 : described in Example 8 and shown in Fig. 7. 

pJaL554: described in Example 8 and shown in Fig. 8 

15 

Deposit of Biological Materials 

The following biological material has been deposited under the 
terms of the Budapest Treaty with the Deutsche Sammlung von 
Microorganismen und Zellkulturen GmbH, Mascheroder Weg lb, D- 3 8 124 
20 Braunschweig, Germany and given the following accession number: 

Deposit Accession Number Date of Deposit 

Escherichia coli, pEES DSM 12294 1998-07-14 

Aspergillus niger 13PAP2 DSM 12298 1998-07-14 
The strain has been deposited under conditions that assure that 
25 access to the culture will be available during the pendency of this 
patent application. The deposit represents a substantially pure 
culture of the deposited strain. The deposit is available as required 
by foreign patent laws in countries wherein counterparts of the 
subject application, or its progeny are filed. However, it should be 
30 understood that the availability of a deposit does not constitute a 
license to practice the subject invention in derogation of patent 
rights granted by governmental action. 
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Example 1 

Cloning of the A. niger prtT transcriptional activator 

The prtT gene was cloned from 13PAP2, an A. niger mutant strain 
which is unable to express the amdS gene regulated by the pepA 
5 protease gene promoter and has a protease deficient phenotype (prt~) . 

Construction of the A. niger 13PAP2 reporter strain 

The plasmid pPAl was contructed by ligation of the following 
three fragments : 

10 1) the E. coli vector pBlueScript II SK (Stratagene Cloning 

Systems, La Jolla CA, USA) digested with EcoRI and Kpnl; 

2) a 1.4 kb EcoRI/BamHI restriction fragment containing the 1.2 
kb promoter region of the pepA gene linked to about 13 0 bp of the amdS 
coding sequence from the start codon to an internal BamHI site, 

15 amplified by PCR; and 

3) a 2.1 kb BamHl/Kpnl fragment from p3SR2 which contains most 
of the A. nidulans amdS gene 

Fragment 2 was constructed in two steps . In a first step genomic 
DNA from A. niger N402 prepared from protoplasts as described below in 
20 the section "Construction of the Cosmid Library" was used as the 
template, and the two oligonucleotides shown below, pepApr and 
pepA/amdS, were used as primers: 

PepApr: CGG AAT TCG CAT GCT GGA GGT GCT TCT AA (SEQ ID NO : 6) 
pepA/amdS: TTC CCA GGA TTG AGG CAT TTT GAC CAC GAG AAT (SEQ ID NO: 7) 

25 The 1200 bp PCR product obtained from this reaction was then used 

as a primer in a second PCR reaction together with the oligonucleotide 
MBL1213 shown below, and plasmid p3SR2 as the template. 
MBL1213: TAA CTT CCA CCG AGG TC (SEQ ID NO: 8) 

The product obtained by ligation of the three fragments described 

30 above was subsequently transfected into E. coli DH5a. 

In the final construction procedure, pPAl was digested with Not I 
and ligated to a 3.8 kb Not I fragment from pABPYRG* - Not , resulting in 
plasmid pPAP which is shown in Fig. 1. pPAP was transformed into A. 
niger AB 1.13, and a transformant with pPAP integrated into the pyrG 

35 locus in multicopy was isolated. A spontaneous 5 fluorotic acid (FOA) 
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resistant, uridine -requiring mutant of this transf ormant that could be 
complemented with the pyrG gene was named 13PAP2. 

Construction of pAopyrGcosArpl 

5 The plasmid pAopyrGcosArpl was constructed by ligation and 

subsequent transfection into E. coli DH5D of the following three 
fragments : 

1) the E. coli vector pHelpl cut with Acc65I and BamHI 

2) a 3.0 kb BamHI/Hindlll fragment from pAnscosl containing two 
10 cos sites 

3) a 3.2 kb Acc65I/HindIII fragment from pA04-2 containing the 
A. oryzae pyrG gene 

The resulting plasmid, pAopyrGcosArpl, is self -replicating in 
Aspergilli and can be selected for by growth on medium lacking 
15 uridine. pAopyrGcosArpl is depicted in Fig. 2. 

Construction of the cosmid library. 

A cosmid library of Aspergillus niger was constructed using the 
"SuperCosl cosmid vector kit" (Stratagene Cloning Systems, La Jolla 
20 CA, USA) according to the supplier's instructions. 

Genomic DNA from A. niger N402 was prepared from protoplasts made 
by standard procedures. 

After isolation the protoplasts were pelleted by centrif ugation 
at 2000 rpm for 10 minutes in a Beckman GS-6R; the pellet was then 
25 suspended in a buffer containing 22.5 mM tri-isonaphtalene sulphonic 
acid, 275 mM para-aminosalicylic acid, 0.2 M Tris-HCl (pH 8.5), 0.25 M 
NaCl and 50 mM EDTA immediately followed by addition of 1 volume of 
phenol/chloroform (1:1). After careful mixing and centrifugation at 
3000 rpm for 2 0 minutes the aqueous phase was decanted and DNA was 
30 precipitated using standard procedures. 

The size of the genomic DNA was analysed by electrophoresis on a 
0.3% agarose gel run for 2 0 hours at 3 0 volts at 4°C. The ethidium 
bromide stained gel showed that the recovered DNA ranged in size from 
50 to greater than 100 kb. The DNA was partially digested using Mbol . 
35 The size of the digested DNA was 30 to 50 kb as determined by the same 
type of gel analysis as above. The pAopyrGcosArpl vector, purified 
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using a kit from QIAGEN (Venlo, The Netherlands) following the 
manufacturer's instructions, was digested with BamHI, dephosphorylated 
and gel purified. Ligation and packaging were performed following 
standard procedures. 
5 After titration of the library, all of the packaging mix from a 

single ligation and packaging was transfected into the host cell, 
MC1046, and plated on 50 fig/ml ampicillin LB plates. Approximately 
40,000 colonies were obtained. Cosmid preparations from 10 colonies 
showed that they all had inserts of the expected size. The 40,000 
10 colonies were then soaked in LB medium and scraped off of the plates, 
then aliquoted for storage in 15% glycerol at -80°C. This represents 
an approximate 40-fold amplification of the A. niger genome. 

Selection of A. niger prtT clones 

15 Cosmid DNA was prepared from the library and introduced into 

13PAP2 according to the transformation procedure described by P.J. 
Punt and C. A. M.J.J. Van Den Hondel (1992. Methods Enzymol 216: 447- 
457) . Repeated efforts to select for the pyrG marker only resulted in 
a recovery of between 4000 to 30,000 transf ormants . A double 

20 selection for the pyrG marker and growth on medium containing 
acetamide as the sole nitrogen source resulted in a total of 65 
primary transf ormants from five different experiments. 

Each primary transformant was screened for protease activity, 
growth on medium containing acetamide as the sole nitrogen source and 

25 instability of these two characteristics. An acetamidase* phenotype, 
screened by growth on medium containing acetamide, is an indication of 
acetamidase activity resulting from activation of the pepA promoter in 
the reporter cassette in which the pepA promoter is linked to the amdS 
coding sequence. A protease + phenotype was screened using minimal 

30 medium plates containing dialyzed skim milk as the sole nitrogen 
source (Mattern, I.E., et al . , 1992. Mol Gen Genet 234:332-336). On 
these plates the wild-type AB4 . 1 strain makes a clear halo whereas the 
AB1.13 mutant produces a very small halo. This difference is not due 
to differences in the activity of pepA since a pepA deleted strain can 

35 also produce a large halo on these plates. Therefore, a large halo on 
milk plates indicates activation of other extracellular proteases. 
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Instability was tested by growing diluted spore stocks on medium 
containing uridine. Single-spore-derived colonies were picked from 
these plates and tested for protease activity and growth on acetamide. 
The screening results revealed that in more than 70% of the colonies 
5 both characteristics were lost. Therefore, the two phenotypes were 
either lost or retained together, indicating that activation of the 
pepA promoter and other protease promoters is coordinately regulated 
and linked to the presence of the pyrG marker. The gene responsible 
for this phenotype was named prtT. Twelve acetamidase + , protease + 
10 transformants were then isolated. 

Isolation of the A.niger prtT gene 

In order to rescue the prtT gene from the acetamidase + , protease" 
transformants of 13PAP2, DNA was prepared from mycelium grown in 

15 minimal medium as previously described. This DNA was used in an 
attempt to transform competent E. coli DH5D cells. Several hundreds 
of ampicillin-resistant colonies were obtained. DNA analysis showed 
they all contained sequences derived from the pHelpl plasmid. Cosmid 
DNA isolated from E. coli colonies was then retransf ormed into 13PAP2 . 

20 Two DNA samples gave rise to transformants, which showed both growths 
on acetamide containing medium and increased protease activity. DNA 
from one of the cosmids, ACR1, was then digested with several 
restriction enzymes. The resulting fragments were then co- transformed 
with pAopyrGcosArpl into strain 13PAP2. EcoRI, PstI, BamHI and Kpnl 

25 digestion of ACR1 gave rise to transformants capable of growth on 
acetamide and high protease activity, whereas Sail and Hindlll digests 
did not. Because EcoRI digestion gave the simplest pattern, separate 
EcoRI fragments were gel-isolated and with pAopyrGcosArpl used to co- 
transform 13PAP2. Only one fragment, a 15 kb EcoRI fragment, gave 

30 rise to transformants capable of growth on acetamide -containing 
medium. This fragment was subcloned in pBluescript II SK in order to 
subclone prtT from the cosmid. Since the insert of this clone was 
still rather large, separate PstI bands were gel isolated and each was 
co- transformed with pAopyrGcosArpl into 13PAP2. Only one band, a 2.5 

35 kb PstI fragment, gave rise to transformants that could grow on 
acetamide-containing medium. This fragment was subcloned in 
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pBlueScript II SK. Four subclones, C1E 0.7, C1E 1.8, NcE 1.1 and NcE 
1.4, were constructed from this plasmid based on the restriction map. 
In addition, a 6.5 kb Sstl/EcoRI fragment encompassing the 2.5 kb PstI 
fragment was subcloned, resulting in pEESl (shown in Fig. 3) . 
5 Southern blot analysis of genomic DNA from AB4 . 1 showed the 

presence of only one copy of prtT. 

Example 2 

Sequencing of the A. niger prtT gene and analysis of the sequence 

10 All sequence reactions were prepared using dRhodamine Terminator 

Cycle Sequencing Kits or BigDye™ Terminator Cycle Sequencing Kits from 
the Perkin- Elmer Corporation (Branchburg NJ, USA) . The reactions were 
run on an ABI PRISM® 377 DNA Sequencer (Perkin-Elmer Corporation) 
following the manufacturer's instructions. 

15 The prtT gene was sequenced from the genomic clones C1E 0.7, C1E 

1.8, NcE 1.1, NcE 1.4 and pEESl. The sequence specific primers used 
are listed below: 
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C (SEQ ID NO: 20) 


128330: 


CTA 


GCA 


GCG 


TAT 


CGG 


TCA 


GC (SEQ 


ID 


NO: 21) 


130887: 


CTT 


GGA 


AAA 


GAA 


ACG 


ATA 


G (SEQ ID NO: 22) 


130888: 


AAC 


GTA 


CGC 


TTT 


CCT 


CCT 


T (SEQ ID NO: 23) 


134135: 


GGG 


TCC 


GTC 


CAG 


TCC 


GTT 


CTT 


(SEQ ID 


NO: 24) 



-48 reverse: AGC GGA TAA CAA TTT CAC ACA GGA (SEQ ID NO: 25) 
35 -40 universal: GTT TTC CCA GTC ACG AC (SEQ ID NO: 26) 



42 



A mutant allele of the gene was obtained by PCR amplification of 
genomic DNA isolated from the mutant strain AB1 . 13 using the following 
primers : 

PstI: TC ATC CCT GGT GTT ACT GC (SEQ ID NO: 27) 
5 Pstll: C ATG GAT TGG CTG GCC G (SEQ ID NO: 28) 

The complete DNA sequence of the prtT gene is shown in SEQ ID NO: 
1. The sequence of the PCR fragment of the mutant allele is shown in 
SEQ ID NO: 4. 

Analysing the DNA sequence SEQ ID NO: 1 using the computer 
10 software Netgene 2 (S.M. Hebsgaard, P.G. Korning, N. Tolstrup, J. 
Engelbrecht, P. Rouze, S. Brunak (1996. Nucleic Acids Research 24: 
3439-3452) suggested the existence of 5 exons (see annotations to SEQ 
ID NO 1) . 

15 Analysis of the A. niger prtT cDNA 

mRNA was purified from total RNA (isolated according to the DNA 
isolation method described above in Example 1) using a commercial 
poly (A) + RNA isolation kit (Pharmacia, Uppsala SE) from a culture of A. 
niger grown under conditions favourable for protease production 
20 (J.P.T.W. Van Den Hombergh, et al . , 1997. Eur. J. Biochem. 247:605- 
613) . Double stranded cDNA was prepared using standard procedures and 
used for PCR reactions with the following primers: 
oligo-dT primer: T2Q N 



Prt270n: 


TACTCTCCAGATTGCCTG 


(SEQ 


ID 


NO: 


29) 


Prtl420r : 


TGAGATACCACTCAGCAG 


(SEQ 


ID 


NO: 


30) 


prtl350n: 


TGCACTTCTCTGTCTCTG 


(SEQ 


ID 


NO: 


31) 


Prt2365r: 


GACTTCTGGCATCAGTTG 


(SEQ 


ID 


NO: 


32) 


prt2320n: 


CTCATGGATGGCATGATC 


(SEQ 


ID 


NO: 


33) 



A PCR reaction with the primers Prt270n and Prtl420r produced a 
30 fragment of approximately 1.0 kb. The fragment was cloned into a 
pGEM-T vector (Promega Corp., Madison WI, USA), and the insert in the 
resulting plasmid was sequenced using the primers 122958, 122960, -40 
universal and -48 reverse. The result confirmed the presence of two 
introns in this part of the gene. 
35 A second PCR reaction with the primers Prtl350n and Prt2365r 

produced a fragment of approximately 0.9 kb. This fragment was also 
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cloned in a pGEM-T vector, and the insert in the resulting plasmid was 
sequenced using the primers 124289, 124290, -40 universal and -48 
reverse. The result confirmed the presence of a single intron in this 
part of the gene. 

5 Another PCR reaction with the oligo-dT primer and primer Prt2320n 

produced a fragment of approximately 350 bp. This fragment was also 
cloned in a pGEM-T vector. Sequencing of the insert using primers -40 
universal and -48 reverse showed that the fragment contained the 3' 
part of prtT and confirmed the presence of another intron. 

10 The deduced protein sequence of the translated prtT gene is shown 

in SEQ ID NO: 2. The deduced protein sequence of the translated mutant 
allele prtl3 is shown in SEQ ID NO: 5. A comparison of SEQ ID NO: 2 
and SEQ ID NO: 5 indicates that the only difference between the two is 
in position 112 where the leucine residue in the translated prtT gene 

15 is replaced by proline in the translated prtl3 gene. 

Analysis of the deduced PrtT protein sequence reveals the 
presence of a Zinc (II) 2Cys6 binuclear cluster DNA binding motif (SEQ 
ID NO: 2, residues 47-81) . This motif defines the GAL4 class of 
fungal transcriptional activators (Reece, M. J., and Ptashne, M. 1993. 

20 Science 261: 909-911). The presence of the motif in the prtT gene 
strongly indicates that prtT is a transcriptional activator. 

Example 3 

Disruption of the prtT gene in a wild- type A. niger strain 

25 A plasmid was constructed in which the upstream and downstream 

sequences of the prtT gene are separated by the A. oryzae pyrG gene. 
Plasmid pEESl was digested with Muni and Nhel, which removed a 2,1 kb 
fragment containing most of the coding sequence of prtT. A 2.3 kb 
EcoRI/Nhel fragment from pA04-13 containing the A. oryzae pyrG gene 

30 was cloned in the Muni and Nhel sites of pEESl . The resulting 
plasmid, shown in Fig. 4, was named pDprt . This construct was then 
used to transform A. niger strain AB4 . 1 to uridine prototrophy. About 
150 uridine prototrophic transf ormants were then analyzed for protease 
activity on skim milk containing plates. Five of these did not make a 

35 halo on these plates indicating that protease activity was very low. 



Comparison of strains with a disrupted prtT gene and the mutant AB1.13 
strain did not show any differences in protease activity or phenotype . 

Example 4 

Overexpression of A. niger PrtT 

A plasmid, pGPprt, (Figure 5) containing the coding region and 3' 
noncoding sequences of prtT fused to the promoter of the A. niger gpd 
gene was constructed. The gpd gene codes for glyceraldehyde-3- 
phosphate dehydrogenase, a const itutively expressed enzyme involved in 
primary metabolism. The promoter used was a fragment upstream of the 
coding region. 

The plasmid is transformed into A. niger AB4 . 1 by 
cotransformation with the pyrG selection plasmid pA04-13. 
Transf ormants with increased prtT transcription as determined by 
Southern blot analysis is analysed for increased protease expression. 

Example 5 

Isolation of the Zn 2+ - finger from the A. oryzae prtT gene 

The A. niger prtT gene is shown in SEQ ID NO: 1. The protein 
20 sequence deduced from the DNA sequence of prtT (SEQ ID NO: 2) 
contains a so called Zn 2+ - finger motif expected to be responsible for 
the DNA binding of the transcriptional activator encoded by prtT. The 
Zn 2+ -finger motif has the following amino acid sequence: Met Thr Ala 
Cys His Thr Cys Arg Lys Leu Lys Thr Arg Cys Asp Leu Asp Pro Arg Gly 
25 His Ala Cys Arg Arg Cys Leu Ser Leu Arg Ile^Asp Cys (SEQ ID NO: 34) . 

Degenerate primers able to code for amino acid sequences from the 
motif were designed and synthesized by DNA Technology A/S, 
Forskerparken, Gustav Wieds vej 10, DK-8000 Aarhus C, Denmark. The 
primers had the following sequences: 
30 137396: A T G A C C/T G C C/T T G C/T C A C/T A C C/T T G (SEQ ID NO: 
35) 

137397: A A/G A/G C A A/G/C/T C G A/G/C/T C G A/G C A A/G G C A/G T G 
(SEQ ID NO: 36) 

The primers were used in a PCR reaction with genomic A. oryzae 
35 IF04177 DNA as template. The reaction was performed in a total volume 
of 100 fil containing 154 pmol of primer 13 73 96 and 10164 pmol of 
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primer 137397. 30 PCR cycles with 56DC as annealing temperature and 30 
seconds elongation time were run. Another PCR reaction using A. niger 
genomic DNA and the primers 137394: ATGACTGCCTGTCACACATG (SEQ ID NO: 
37) and 137395: AGACAGCGACGGCACGCATG (SEQ ID NO: 38), which are 
5 specific for the A. niger prtT gene, was also run. In this reaction 10 
pmol of each primers was used in a 100 ptl reaction. Aliquots of the 
two reactions were applied to a 3% agarose gel. After electrophoresis 
three approximately equally intense bands could be seen in the A. 
oryzae reaction and two bands in the A. niger reaction. One of the 

10 bands in the A. niger reaction was more intense than the other and 
further had the expected size. One of the A. oryzae bands had the same 
size as the most intense A. niger band and was isolated from the gel. 
The fragment was cloned into the vector pCR2.1 (Invitrogen™) . Plasmids 
from two individual colonies were sequenced. The sequences are shown 

15 in figure 1. The two sequences differ at the end reflecting their 
origin in different degenerate primers. They are identical in the 
middle 40bp, which are amplified from the genomic DNA. These 40 
basepairs encode a polypeptide identical to a part of the Zn 2+ - finger 
of the A. niger prtT gene. 

20 

Example 6 

Isolation of the N- terminal of the A. oryzae prtT gene 

The inverse PCR method was used to isolate the A. oryzae prtT 
gene. The primers 144428: CACCGAGTTTTAAGCTTGCGG (SEQ ID NO: 39) and 

25 144429: CGATCTTGATCCACGAGGG (SEQ ID NO: 40) were synthezised by DNA 
Technology A/S (Denmark) . Genomic DNA was cut with a number of 
restriction enzymes and religated. The ligation mixtures were used as 
templates in PCR reactions with the primers 144428/144429. In a 
reaction with BamHI restricted and religated DNA as template a 

30 fragment of approximately 2.5kb was observed after electrophoresis on 
an agarose gel. The fragment was labelled with 32 P by the random 
priming method and used as a probe against a filter containing a 
gridded cosmid library of genomic A. oryzae DNA. The construction of 
the library is described in WO 98/01470. The cosmid 11F8 showed a 

35 positive hybridization signal with the probe. A Southern blot 
containing DNA from 11F8 and genomic DNA restricted with BamHI, EcoRI, 
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PstI or Xhol was probed with the 2.5kb inverse PCR fragment. The size 
of hybridizing bands from genomic DNA were compared with those from 
the cosmid DNA. Apparently some rearrangement of the cosmid had 
occurred since only a minority of the bands from the genomic DNA had 
5 counterparts in the cosmid. Two hybridizing fragments from the cosmid, 
a 1.2 kb EcoRI fragment and a 1.0 kb PstJ fragment, looked equal in 
size to hybridizing genomic fragments. The two fragments were sub- 
cloned from the cosmid and sequenced. Analysis of the sequence data 
showed that the fragments overlap. In total 1497 bp of sequence was 

10 obtained. Oligonucleotides encoding the Zn 2+ -finger were not contained 
within the sequence. A BamHI site was found close to one end of the 
sequence in a region only covered by the EcoRI sub-clone, thus 
allowing the position of the sequenced genomic fragment relative to 
the Zn 2+ - finger to be determined. 

15 The primer 153468: CGGGATGAATTGTAGAGAGGC (SEQ ID NO: 41) was 

prepared by DNA Technology A/S (Denmark) . The primer sequence is 
contained within the 1497 bp fragment. It is found at the end closest 
to the Zn 2+ -finger and points in that direction. Two primers both of 
prtT Zn 2+ -finger specific sequence and pointing either downstream 

20 (140358) or upstream (140359) were also prepared by DNA Technology A/S 
(Denmark). The sequence of the two primers are as follows: 140358: 
CGCAAGCTTAAAACTCGGTGCGATC (SEQ ID NO: 42) and 140359: 
CCTCGTGGATCAAGATCGCA (SEQ ID NO: 43) . Two PCR reactions, one with the 
primers 153468 and 140358 and one with 153468 and 140359, 

25 respectively, were performed with genomic DNA as template. The 
reaction with 153468 and 140359 gave a band of approximately 1.1 kb, 
the other reaction gave no visible bands, when analysed on an agarose 
gel. The 1.1 kb fragment was cloned into pCR4Blunt-TOPO (Invitrogen) 
and sequenced. The fragment contained part of the Zn 2+ - finger and 

30 overlaps with the 1497bp fragment. Translation of the sequence showed 
that the region immediately upstream of the Zn 2+ - finger encodes a 
polypeptide with homology to the N- terminal of prtT from A. niger. 
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Example 7 

Isolation of the complete A. oryzae prtT gene 

The remaining parts of the gene were cloned by two consecutive 
rounds of inverse PCR. In the first inverse PCR reaction the genomic 
5 DNA was restricted with EcoRV and re-ligated. The PCR reaction was run 
with the primers 175653: GATGAAAAGAATAATCGGCGAG (SEQ ID NO: 44) and 
175654: CGCGGCACACTACCCCCGTTG (SEQ ID NO: 45). The reaction resulted 
in the synthesis of a 1 . 9kb fragment, which was cloned into the 
pCR4Blunt-T0P0 vector and sequenced. Analysis of the sequence data 

10 showed that the fragment contains a gene with homology to the A. niger 
prtT gene and that the 3 "end of the gene was missing. The second 
inverse PCR reaction was thus performed. The primers were B0403G08: 
ATCTAGCTCAAGCATTAGCGGC (SEQ ID NO: 46) and B0403G09: 

AATTTCGGCCCTTTAGTGTCC (SEQ ID NO: 47) . Bglll restricted and re-ligated 

15 genomic DNA was used as template. A 2.4 kb fragment was obtained and 
cloned into the pCR4Blunt-TOPO vector and sequenced. Analysis of the 
sequence showed that the complete A. oyzae prtT gene had been 
obtained. The DNA sequence of the A. oryzae prtT gene is shown in SEQ 
ID NO: 48 and the deduced amino acid sequence of the encoded protein 

20 is shown in SEQ ID NO: 49. 

Example 8 

Disruption of the Aspergillus oryzae prtT gene 

The A. oryzae prtT gene was disrupted using a method of 
25 positive/negative selection. A disruption cassette consisting of 2kb 
of the A. oryzae prtT gene (SEQ ID NO: 48) with an insertion of the 
pyrG gene in the middle is cloned into a vector (pDV8) containing the 
herpes simplex virus I tymidine kinase gene (HSV-tk) flanked by fungal 
expression signals. Expression of the tymidine kinase gene makes the 
30 host sensitive to 5-f luoro-2-deoxyuridine . A disrupted strain can be 
isolated by positive selection for the pyrG gene in a pyrG minus host 
and deselection of the tymidine kinase gene on 5-fluoro-2- 
deoxyuridine . Since the tymidinie kinase gene and the pyrG gene are 
present in the same DNA fragment selection is for transf ormants in 
35 which a double cross -over event has happened. The system gives fewer 
transf ormants pr. Micro g of DNA than transformation with just a 
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disruption cassette, but the frequency of transf ormants in which the 
desired homologous recombination event has occurred is much higher. 

The pyrG gene used here is flanked by repeats enabling a later 
removal by selection for 5-f luoroorotic acid resistance. The pyrG gene 
5 is isolated from the plasmid pJaL554 . 

The pDV8 plasmid 

pDV8 was kindly provided by Matthew S. Sachs, University of 
Oregon, PO Box 91000, Portland, OR 97291-1000, USA. pDV8 (Fig. 7) is a 

10 pSP65 (Promega™) based plasmid containing the HSV-tk gene on a 1.2 kb 
Bglll/BamHI fragment inserted between a 1.0 kb Xhol/Bglll fragment of 
the A. nidulans gpd promoter and a 0.8 kb BamHl/Hindlll fragment 
containing the A. nidulans trpC transcriptional terminator. 

The A. nidulans gpd promoter and the trpC transcriptional 

15 terminator are taken from the plasmid pAN51-2 (Punt et al . , (1990), 
Gene 93, p. 101-109). The HSV-tk gene is described by McKnight S.L., 
(1980), Nucleic Acids Res. 8:5949-5964, Database accession no. EMBL 
v00470, position 252-1479. The construction of pDV8 is described in 
Vaught -Alexander D (thesis) Expression of the herpes simplex virus 

20 type-1 thymidine kinase gene in Neurospora crassa, (1994) , Oregon 
Graduate Institute of Science & Technology, University of Portland, PO 
Box 91000, Portland, OR 97291-1000, USA. The sequence of pDV8 is 
included in this application as SEQ ID NO: 50. Single-, double- and 
multicopy A. oryzae transf ormants of pDV8 were isolated by 

25 transforming a pDV8 derivative containing the A. oryzae niaD gene into 
an A. oryzae niaD mutant. The copy number of the HSV-tk gene was 
determined by Southern analysis. The transf ormants and the 
untransf ormed host were inoculated onto plates containing varying 
concentrations of 5-f luoro-2 ' -deoxyuridine . From inspection of the 

30 growth on the plates it was decided to use 6 microM of 5-fluoro-2'- 
deoxyuridine in the plates for future positive/negative selection 
experiments. At this concentration none of the pDV8 transf ormants 
grew, while the untransf ormed host was only slightly inhibited. 



49 



Description of pJaL554 

PJaL554 was constructed by ligating the 316 bp Asp718-Nhel 
fragment to the 5336 bp Spel-SspBI fragment from the pyrG containing 
plasmid pS02 (described in WO 97/35956) . Thus, pJaL554 harbours the A. 
5 oryzae pyrG gene flanked by 316 bp repeats. The construction is 
illustrated in Fig. 8. 

Construction of a prtT disruption plasmid in the pDV8 vector 

A PCR reaction is performed on chromosomal A. oryzae IF04177 DNA 
with the primers B1042E05 and B1450E07. 
B1042E05: CGCGCGTATCCTATTGCC (SEQ ID NO: 51) 
B1450E07: GCCGGAAATGTTGTACCTAC (SEQ ID NO: 52). 

A fragment of 2078 basepairs is obtained and cloned into the 
pCR4Blunt-TOPO ( Invitrogen™) vector. The resulting plasmid is 
sequenced with the standard M13 forward (-40) and reverse primers to 
ensure that the correct fragment is obtained. The PCR fragment is 
excised from the vector by the restriction enzyme EcoRV which cuts 
twice internally in the fragment. The cut sites are located at 
positions 1 and 1964 in SEQ ID NO: 48. The 1964 bp fragment is ligated 
with the pDV8 vector, which has been cut with Hindlll and blunt ended 
by filling in the ends with the Klenow fragment of DNA polymerase I 
from E. coli and dNTP. The resulting plasmid is cut with Hindi II , 
which is located in the prtT fragment (in the part encoding the Zn 2+ - 
finger) at position 962 in SEQ ID NO: 48, dephosphorylated and ligated 
with the pyrG gene isolated from pJaL554 as a 2.5 kb Hindlll fragment. 

Selection of prtT disrupted strains 

The disruption plasmid described above was linearized with NotI 
and transformed into A. oryzae HowBlOl (described in WO 97/3 5956) , a 
30 pyrG minus derivative of IF04177. The transformation is done essential 
as described in EP 0 238 023. Transf ormants are selected on plates 
containing Coves salt solution (Cove DJ, (1966), Biochim. Biophys. 
Acta 113:51-56), 1 M sucrose for osmotic stabilization and as carbon 
source, 20g/L agar, 10 mM NaN0 3 and 6 microM 5-f luoro-2-deoxyuridine 
35 (Sigma) . The transf ormants are reisolated once on the same type of 
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plates. Trans formants carrying a disrupted prtT gene are identified by 
Southern blot analysis. 

A strain carrying the prtT disruption is used as host for 
expression of a truncated PDI gene (Protein Disulfide isomerase gene) 
5 harbored on the expression plasmid pCaHj445 (described in US patent 
5,879,664). pCaHj445 is transformed into the A. oryzae prtT disrupted 
strain by cotrans format ion with the plasmid p3SR2 containing the A. 
nidulans amdS gene. Transformation and selection on acetamide plates 
is done essentially as described in EP 0 238 023. After reisolation 
10 the trans formants are fermented in shake flasks or fermentors and the 
PDI protein is purified from the fermentation broth. 

The invention described and claimed herein is not to be limited 
in scope by the specific embodiments herein disclosed, since these 

15 embodiments are intended as illustrations of several aspects of the 
invention. Any equivalent embodiments are intended to be within the 
scope of this invention. Indeed, various modifications of the 
invention in addition to those shown and described herein will become 
apparent to those skilled in the art from the foregoing description. 

20 Such modifications are also intended to fall within the scope of the 
appended claims . 

Various references are cited herein, the disclosures of which are 
incorporated by reference in their entireties. 
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